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FIGURE 1 

GTTACTCGGTGGTGGCGGAGTCTACGGAAGCCGTTTTCGCTTCACTTTTCCTGGCTGTAGAGC 

GCTTTCCCCCTGGCGGGTGAGAGTGCAGAGACGAAGGTGCGAGATGAGCACTATGTTCGCGGA 

CACTCTCCTCATCGTTTTTATCTCTGTGTGCACGGCTCTGCTCGCAGAGGGCATAACCTGGGT 

CCTGGTTTACAGGACAGACAAGTACAAGAGACTGAAGGCAGAAGTGGAAAAACAGAGTAAAAA 

ATTGGAAAAGAAGAAGGAAACAATAACAGAGTCAGCTGGTCGACAACAGAAAAAGAAAATAGA 

GAGACAAGAAGAGAAACTGAAGAATAACAACAGAGATCTATCAATGGTTCGAATGAAATCCAT 

GTTTGCTATTGGCTTTTGTTTTACTGCCCTAATGGGAATGTTCAATTCCATATTTGATGGTAG 

AGTGGTGGCAAAGCTTCCTTTTACCCCTCTTTCTTACATCCAAGGACTGTCTCATCGAAATCT 

GCTGGGAGATGACACCACAGACTGTTCCTTCATTTTCCTGTATATTCTCTGTACTATGTCGAT 

TCGACAGAACATTCAGAAGATTCTCGGCCTTGCCCCTTCACGAGCCGCCACCAAGCAGGCAGG 

TGGATTTCTTGGCCCACCACCTCCTTCTGGGAAGTTCTCTT^ACTCAAGAACTCTTTATTTT. 

CTATCATTCTTTCTAGACACACACACATCAGACTGGCAACTGTTTTGTAGCAAGAGCCATAGG 

TAGCCTTACTACTTGGGCCTCTTTCTAGTTTTGAATTATTTCTAAGCCTTTTGGGTATGATTA 

GAGTGAAAATGGCAGCCAGCAAACTTGATAGTGCTTTTGGTCCTAGATGATTTTTATCAAATA 

AGTGGATTGATTAGTTAAGTTCAGGTAATGTTTATGTAATGAAAAACAAATAGCATCCTTCTT 

GTTTCATTTACATAAGTATTTTCTGTGGGACCGACTCTCAAGGCACTGTGTATGCCCTGCAAG 

TTGGCTGTCTATGAGCATTTAGAGATTTAGAAGAAAAATTTAGTTTGTTTAACCCTTGTAACT 

GTTTGTTTTGTTGTTGTTTTTTTTTCAAGCCAAATACATGACATAAGATCAATAAAGAGGCCA 

AATTTTTAGCTGTTTTATGTACAAGGAGAGATCTGTTTCATTTTGTTTTGCCGTATTTCTAGA 

TATAAGTTTTAGCATGGGCCAGGAAGGACTAAAATAAAAGTTTTTAAGGTACAAAAAAAAAAA 

AAAA 
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FIGURE 2 

MSTMFADTLLIVFISVCTALLAEGITWVLVYRTDKYKRLKAEVEKQSKKLEKKKETITESAGR 
QQKKKIERQEEKLKNNNRDLSMVRMKSMFAIGFCFTALMGMFNSIFDGRVVAKLPFTPLSYIQ 
GLSHRNLLGDDTTDCSFIFLYILCTMSIRQNIQKILGLAPSRAATKQAGGFLGPPPPSGKFS 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites . . 

amino acids 103-109, 163-169 



c»MP- and cGMP-dependent protein kinase phosphorylation 

amino acids 53-57 
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FIGURE 3 

AGCCGGGGGCGGGTTTGAAGACGCGTCGTTGGGTTTTGGAGGCCGTGAAACAGCCGTTTGAGT 
TTGGCTGCGGGTGGAGAACGTTTGTCAGGGGCCCGGCCAAGAAGGAGGCCCGCCTGTTACGAT 
GGTGTCCATGAGTTTCAAGCGGAACCGCAGTGACCGGTTCTACAGCACCCGGTGCTGCGGCTG 
TTGCCATGTCCGCACCGGGACGATCATCCTGGGGACCTGGTACATGGTAGTAAACCTATTGAT 
GGCAATTTTGCTGACTGTGGAAGTGACTCATCCAAACTCCATGCCAGCTGTCAACATTCAGTA 
TGAAGTCATCGGTAATTACTATTCGTCTGAGAGAATGGCTGAT7VATGCCTGTGTTCTTTTTGC 
CGTCTCTGTTCTTATGTTTATAATCAGTTCAATGCTGGTTTATGGAGCAATTTCTTATCAAGT 
GGGTTGGCTGATTCCATTCTTCTGTTACCGACTTTTTGACTTCGTCCTCAGTTGCCTGGTTGC 
TATTAGTTCTCTCACCTATTTGCCAAGAATCAAAGAATATCTGGATCAACTACCTGATTTTCC 
CTACAAAGATGACCTCCTGGCCTTGGACTCCAGCTGCCTCCTGTTCATTGTTCTTGTGTTCTT 
TGCCTTATTCATCATTTTTAAGGCTTATCTAATTAACTGTGTTTGGAACTGCTATAAATACAT 
CAACAACCGAAACGTGCCGGAGATTGCTGTGTACCCTGCCTTTGAAAGCACCTCCTCAGTACG 
TTTTGCCAACCTATGAAATGGCCGTGAAAATGCCTGAAAAAGAACCACCACCTCCTTACTTAC 
CTGCCTGAAGAAATTCTGCCTTTGACAATAAATCCTATACCAGCTTTTTGTTTGTTTATGTTA 
CAGAATGCTGCAATTCAGGGCTCTTCAAACTTGTTTGATATAAAATATGTTGTCTTTTGTTTA 
AGCATTTATTTTCAAACACTAAGGAGCTTTTTGACATCTGTTAAACGTCTTTTTGTTTTTTTG 
TTAAGTCTTTTACATTTTAATAGTTTTTGAAGACAATCTAGGTTAAGCAAGAGCAAAGTGCCA 
TTGTTTGCCTTTAATTGGGGGGTGGGAAGGGAAAGAGGGTACTTGCCACATAGTTTCCTTTTT 
AACTGCACTTTCTTTATATAATCGTTTGCATTTTGTTACTTGCTACCCTGAGTACTTTCAGGA 
AGACTGACTTAAATATTCGGGGTGAGTAAGTAGTTGGGTATAAGATCTGAACTTTTCATCTGC 
AGAGGCAAGAAAAATATTTGACATTGTGACTTGACTGTGGAAGATGATGGTTGCATGTTTCTA 
GTTTGTATATGTTTCCATCTTTGTGATAAGATGATTTAATAAATCTCTTTAAATACTAAAAAA 
AAAAAAAAA 
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FIGURE 4 

MVSMSFKRNRSDRFYSTRCCGCCHVRTGTIILGTWYMVVNLLMAILLTVEVTHPNSMPAVNIQ 
YEVIGNYYSSERMADNACVLFAVSVLMFIISSMLVYGAISYQVGWLIPFFCYRLFDFVLSCLV 
AISSLTYLPRIKEYLDQLPDFPYKDDLLALDSSCLLFIVLVFFALFIIFKAYLINCVWNCYKY 
INNRNVPEIAVYPAFESTSSVRFANL 

Important features of the protein: 

Transmembrane domain (Possible type II transmembrane protein) : 

amino acids 30-49, 81-100, 111-131, 158-175 

N-glycosylation site. 

amino acids 9-13 

Tyrosine kinase phosphorylation sites. 

amino acids 8-16, 193-202 

N-myristoylation site. 

amino acids 68-74 



V 
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FIGURE 5 

CCCGCTGGCCCGTCAGTGCTCTCCCCGTCGTTTGCCCTCTCCAGTTCCCCCAGTGCCTGCCCT 
ACGCACCCCGATGGCGGAGCTGCGGCCTAGCGGCGCCCCCGGCCCCACCGCGCCCCCGGCCCC 
TGGCCCGACTGCCCCCCCGGCCTTCGCTTCGCTCTTTCCCCCGGGACTGCACGCCATCTACGG 
AGAGTGCCGCCGCCTTTACCCTGACCAGCCGAACCCGCTCCAGGTTACCGCTATCGTCAAGTA 
CTGGTTGGGTGGCCCAGACCCCTTGGACTATGTTAGCATGTACAGGAATGTGGGGAGCCCTTC 
TGCTAACATCCCCGAGCACTGGCACTACATCAGCTTCGGCCTGAGTGATCTCTATGGTGACAA 
CAGAGTCCATGAGTTTACAGGAACAGATGGACCTAGTGGTTTTGGCTTTGAGTTGACCTTTCG 
TCTGAAGAGAGAAACTGGGGAGTCTGCCCCACCAACATGGCCCGCAGAGTTAATGCAGGGCTT 
GGCACGATACGTGTTCCAGTCAGAGAACACCTTCTGCAGTGGGGACCATGTGTCCTGGCACAG 
CCCTTTGGATAACAGTGAGTCAAGAATTCAGCACATGCTGCTGACAGAGGACCCACAGATGCA 
GCCCGTGCAGACACCCTTTGGGGTAGTTACCTTCCTCCAGATCGTTGGTGTCTGCACTGAAGA 
GCTACACTCAGCCCAGCAGTGGAACGGGCAGGGCATCCTGGAGCTGCTGCGGACAGTGCCTAT 
TGCTGGCGGCCCCTGGCTGATAACTGACATGCGGAGGGGAGAGACCATATTTGAGATCGATCC 
ACACCTGCAAGAGAGAGTTGACAAAGGCATCGAGACAGATGGCTCCAACCTGAGTGGTGTCAG 
TGCCAAGTGTGCCTGGGATGACCTGAGCCGGCCCCCCGAGGATGACGAGGACAGCCGGAGCAT 
CTGCATCGGCACACAGCCCCGGCGACTCTCTGGCAAAGACACAGAGCAGATCCGGGAGACCCT 
GAGGAGAGGACTCGAGATCAACAGCAAACCTGTCCTTCCACCAATCAACCCTCAGCGGCAGAA 
TGGCCTCGCCCACGACCGGGCCCCGAGCCGCAAAGACAGCCTGGAAAGTGACAGCTCCACGGC 
CATCATTCCCCATGAGCTGATTCGCACGCGGCAGCTTGAGAGCGTACATCTGAAATTCAACCA 
GGAGTCCGGAGCCCTCATTCCTCTCTGCCTAAGGGGCAGGCTCCTGCATGGACGGCACTTTAC 
ATATAAAAGTATCACAGGTGACATGGCCATCACGTTTGTCTCCACGGGAGTGGAAGGCGCCTT 
TGCCACTGAGGAGCATCCTTACGCGGCTCATGGACCCTGGTTACAACTCTGAACCTATCCTCG 
GAGCTCTGCCCTCCCGTCCTGGAACGTCTTTCTGCCCTGAGGAGAGGGTAGTCAGCATCTCCA 
ATTTTCAGCAGCTCAAGAACCTTGGCCCCCACAGGACTTCGCAGATGTCACATTGCCCCTCAG 
TCCCCTGAATGCCCTTCGGACCCAACCCCAATTCCCCAAGCCCCTGACCCCCTAGCTGCCGGG 
GTTCCCACTCCCAGTGCCACAACCCCCTCACCTCCCCTGGCAGCCCCTCAGCGAGCCTGAGGC 
CCAGCACCCGCTGGCTCCCCAGCACATGGTCCCCTCCCATGGGCTGTTGCCCAGGGAACCGGG 
GCGCGGTGGGAACGAGCTGCTGGCCTCGGCATGTTTCAATAAAGTTGCTGTGCTGGGAG 
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FIGURE 6 

MAELRPSGAPGPTAPPAPGPTAPPAFASLFPPGLHAIYGECRRLYPDQPNPLQVTAIVKYWLG 
GPDPLDYVSMYRNVGSPSANIPEHWHYISFGLSDLYGDNRVHEFTGTDGPSGFGFELTFRLKR 
ETGESAPPTWPAELMQGLARYVFQSENTFCSGDHVSWHSPLDNSESRIQHMLLTEDPQMQPVQ 
TPFGVVTFLQIVGVCTEELHSAQQWNGQGILELLRTVPIAGGPWLITDMRRGETIFEIDPHLQ 
ERVDKGIETDGSNLSGVSAKCAWDDLSRPPEDDEDSRSICIGTQPRRLSGKDTEQIRETLRRG 
LEINSKPVLPPINPQRQNGLAHDRAPSRKDSLESDSSTAI I PHELIRTRQLESVHLKFNQESG 
ALIPLCLRGRLLHGRHFTYKSITGDMAITFVSTGVEGAFATEEHPYAAHGPWLQL 

Important features: 
N-glycosylation site. 

amino acids 265-268 
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FIGURE 7 

CGCGAATGAAGTTTGCATTTTCCTCTGTTCTTGAGCCCAGCTTCTTCTCGTCTCCCACCCCAG 
CTTCCCGGCATTGGAAGAAGGGACCGTCCTCTTCCTTGTCTTGGCCACCCAAATCCTGGTATC 
GAAAGGGTTGAACGGACCGGAAGTGTGCAGCAGCGACGGGTCCCCAGCTAATCGACGCCGGAA 
GTAGCAATTACTAGACAAGCATTCCGCCGCCGGCTTCGCTATGGCGGCAATTCCCCCAGATTC 
CTGGCAGCCACCCAACGTTTACTTGGAGACCAGCATGGGAATCATTGTGCTGGAGCTGTACTG 
GAAGCATGCTCCAAAGACCTGTAAGAACTTTGCTGAGTTGGCTCGTCGAGGTTACTACAATGG 
CACAAAATTCCACAGAATTATCAAAGACTTCATGATCCAAGGAGGTGACCCAACAGGGACAGG 
TCGAGGTGGTGCATCTATCTATGGCAAACAATTTGAAGATGAACTTCATCCAGACTTGAAATT 
CACGGGGGCTGGAAT.TCTCGCAATGGCCAATGCGGGGCCAGATACCAATGGCAGCCAGTTCTT 
TGTGACCCTCGCCCCCACCCAGTGGCTTGACGGCAAACACACCATTTTTGGCCGAGTGTGTCA 
GGGCATAGGAATGGTGAATCGCGTGGGAATGGTAGAAACAAACTCCCAGGACCGCCCTGTGGA 
CGACGTGAAGATCATTAAGGCATACCCTTCTGGGTAGACTTGCTACCCTCTTGAGCAGCTCTT 
CTGAGATGGCCCCAGTGAACCAGCTTCTAGATGACATAGAATGACATGTAATGCTAAATTTCA 
TTTTGGCTTTGCAAGTCATGAAGCTTAGGAGGCCTGGCATCTTGGGTGAGTTAGAGATGGAAG 
TACATTTTAATAGGATGCTTCTTTTCTCTTCCCCCAGTGCCTAGGTTGCCAGAGCATTTGCAC 
AAATGCCCCTGTTTATC7^ATAGGTGACTACTTACTACACATGAACCATAATGCTGCTTCTTGT 
GCATGTCTGCTCTGATATACGTCGAACAATGTAGCAGCGACTGTCATTTCTCAGTGGTTTTGC 
CTAACCAAACTTCTTCCTAAGGAGATTTATATTCTGGCCTACACAGCAGTCCTTGATGGCTGA 
CAGCCACAGAATTCCAAACCAAGTAGTGTCTGTCAGCCCTCTTAACTCTGTGCACGCCCTATT 
TCAGTCTTTTACATTTGTTCTTCTAGGGAATGTATGCATCTCTATATATATTTTCCCTCTCAA 
AACCAGAACATCAACAGTGCTGTTTCTGACACTTCAGACATCCCACGCAAAGCCACATTGAAT 
TTTTGCCAAATGAAAAACACATCCAACAATCAAGTTTCTAAGAAGGTGTCAAGTGGGGAATAA 
TAATAATGTATAATAATCAAGAAATTAGTTTATTAAAAGGAAGCAGAAGCATTGACCATTTTT 
TCCCAGAGAAGAGGAGAAATCTGTAGTGAGCAAAGGACAGACCATGAATCCTCCTTGAGAAGT 
AGTACTCTCAGAAAGGAGAAGCGCCACTCAAGTTCTTTTAACCCAAGACTTTAGAGAAATTAG 
GTCCAAGATTTTTATATGTTCAGTTGTTTATGTATAAAAATAACTTTCTGGATTTTGTGGGGA 
GGAGCAGGAGAGGAAGGAAGTTAATACCTATGTAATACATAGAAACTTCCACAATAAAATGCC 
ATTGATGGTTAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 

MAAIPPDSWQPPNVYLETSMGIIVLELYWKHAPKTCKNFAELARRGYYNGTKFHRIIKDFMIQ 
GGDPTGTGRGGASIYGKQFEDELHPDLKFTGAGILAMANAGPDTNGSQFFVTLAPTQWLDGKH 
TIFGRVCQGIGMVNRVGMVETNSQDRPVDDVKIIKAYPSG 

Important features: 
N-glycosylation sites: 

amino acids 49-52, 108-111 

N-myristoylation sites: 

amino acids 64-69, 69-74, 143-148 

Cyclophilin-type peptidyl -prolyl cis-trans isomerase signature: 

amino acids 48-65 
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FIGURE 9 

CGGACGCGTGGGCGCGCGCGAGCGCAGCGGTGGGAGGCGGCGACCAGCCGGTTGAGGCCCCAG 

GCTTGGCCTCACCACAATGTGGCACGAGGCTCGGAAGCATGAGCGGAAGCTTCGAGGCATGAT 

GGTCGACTACAAGAAGAGGGCGGAGCGGAGACGGGAGTATTATGAAAAGATCAAGAAGGACCC 

AGCCCAGTTCCTGCAGGTACATGGCCGAGCTTGCAAGGTGCACCTGGATTCTGCAGTCGCCCT 

GGCCGCTGAGAGCCCTGTTAATATGATGCCCTGGCAGGGGGACACCAACAACATGATTGACCG 

ATTCGATGTCCGTGCCCACCTGGACCACATCCCCGACTACACCCCCCCTCTGCTCACCACCAT 

CTCCCCAGAACAGGAGTCGGACGAACGGAAGTGTAACTACGAGCGCTACAGAGGCCTGGTGCA 

GAACGACTTTGCCGGCATCTCAGAGGAGCAGTGCCTGTACCAGATCTACATTGATGAGTTGTA 

CGGAGGCCTCCAGAGACCCAGCGAAGATGAGAAGAAGAAGCTGGCAGAGAAGAAGGCTTCCAT 

CGGTTATACCTACGAGGACAGCACGGTGGCCGAGGTAGAGAAGGCGGCAGAAAAGCCAGAGGA 

GGAGGAGTCAGCGGCCGAGGAGGAGAGCAACTCGGACGAAGATGAGGTCATCCCCGACATCGA 

CGTGGAGGTGGACGTGGATGAATTGAACCAGGAGCAGGTGGCAGATCTCAACAAACAGGCCAC 

GACTTATGGCATGGCCGACGGTGACTTCGTCAGGATGCTCCGGAAAGACAAGGAGGAGGCAGA 

GGCCATCAAGCATGCCAAGGCTCTTGAGGAGGAGAAGGCCATGTACTCGGGACGCCGCTCTCG 

ACGCCAGCGGAGAGAGTTTCGGGAGAAGCGGCTGAGGGGTCGCAAGATCAGCCCACCCAGCTA 

TGCCCGCCGAGACAGCCCCACCTATGACCCCTATAAGCGGTCACCCTCGGAGTCCAGCTCAGA 

GTCCCGCTCCCGCTCCCGCTCCCCGACCCCGGGCCGCGAGGAGAAGATCACGTTCATCACCAG 

TTTTGGGGGCAGCGATGAGGAGGCAGCCGCAGCCGCTGCTGCCGCAGCAGCATCAGGAGTCAC 

CACAGGGAAGCCCCCCGCACCTCCCCAGCCTGGCGGCCCCGCCCCGGGACGTAATGCCAGCGC 

CCGCCGCCGCTCCTCCTCCTCCTCCTCCTCCTCTTCTGCCTCGAGGACCTCCAGCTCCCGCTC 

CAGCTCTCGCTCCAGCTCCCGCTCTCGCCGTGGTGGGGGCTACTACCGTTCCGGCCGCCACGC 

CCGCTCCCGGTCCCGCTCCTGGTCCCGCTCCCGCTCCCGCTCCCGGCGCTATTCCCGGTCCCG 

TAGCCGTGGCCGGCGGCACTCAGGTGGGGGCTCCCGAGACGGACACCGGTACTCCCGCTCGCC 

CGCCCGGCGTGGTGGTTACGGGCCCCGGCGCAGAAGCAGGAGCCGCTCCCACTCAGGGGACCG 

CTACAGGCGGGGCGGCCGGGGCCTCAGGCACCACAGCAGTAGCCGCAGCCGCAGCAGCTGGTC 

CCTCAGCCCGTCCCGCAGTCGCAGCCTGACTCGCAGCCGCAGCCATAGCCCCAGCCCCAGCCA 

GAGCCGCAGCCGCAGCCGCAGCCGCAGCCAGAGCCCCTCGCCATCACCCGCAAGAGAGAAGCT 

GACCAGGCCGGCCGCGTCCCCTGCTGTGGGCGAGAAGCTGAAAAAGACCGAACCTGCCGCTGG 

TAAAGAGACAGGAGCTGCCAAAGTCACCCAAGCTGACGCCTCAGGAGAAGCTGAAACTGAGGA 

TGCAGAAGGCGCTGAACAGGCAGTTCAAGGCGGATAAGAAGGCGGCACAAGAAAAGATGATCC 

AGCAGGAGCATGAGCGGCAGGAGCGGGAAGACGAGCTTCGAGCCATGGCCCGCAAGATCCGCA 

TGAAGGAGCGGGAACGCCGAGAGAAGGAGAGAGAAGAGTGGGAACGCCAGTACAGCCGGCAGA 

GCCGCTCACCCTCCCCCCGATACAGTCGAGAATACAGCTCTTCTC'GAAGGCGCTCAAGGTCCC 

GATCCCGAAGCCCCCATTACCGACATTAGGCAGAAGAGTGGGGGGTGGGGAGGACAAGGGGGT 

GGGTAAGGGGCTCAAGCTGTGATGCTGCTGGTTTTATCTCTAGTGAAATAAAGTCAAAAGTTA 

TTTAATTCCCGTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

AAAAAAAA 



WO 01/40466 



PCT/US00/32678 



10/550 

FIGURE 10 

MWHEARKHERKLRGMMVDYKKRAERRREYYEKIKKDPAQFLQVHGRACKVHLDSAVALAAESP 
VNMMPWQGDTNNMIDRFDVRAHLDHIPDYTPPLLTTISPEQESDERKCNYERYRGLVQNDFAG 
ISEEQCLYQIYIDELYGGLQRPSEDEKKKLAEKKASIGYTYEDSTVAEVEKAAEKPEEEESAA 
EEESNSDEDEVIPDIDVEVDVDELNQEQVADLNKQATTYGMADGDFVRMLRKDKEEAEAIKHA 
KALEEEKAMYSGRRSRRQRREFREKRLRGRKISPPSYARRDSPTYDPYKRSPSESSSESRSRS 
RSPTPGREEKITFITSFGGSDEEAAAAAAAAAASGVTTGKPPAPPQPGGPAPGRNASARRRSS 
SSSSS SSASRTSSSRSSSRSSSRSRRGGGYYRSGRHARSRSRSWSRSRSRSRRYSRSRSRGRR 
HSGGGSRDGHRYSRSPARRGGYGPRRRSRSRSHSGDRYRRGGRGLRHHSSSRSRSSWSLSPSR 
SRSLTRSRSHSPSPSQSRSRSRSRSQSPSPSPAREKLTRPAASPAVGEKLKKTEPAAGKETGA 
AKVTQADASGEAETEDAEGAEQAVQGG 

Important features : 
N-glycosylation site: 

amino acids 370-373 

Glycosaminoglycan attachment site: 

amino acids 443-446 

cAMP- and cGMP-dependent protein kinase phosphorylation site: 

amino acids 159-162, 282-285, 291-294, 374-377, 375-378, 430-433, 
440-443, 466-469 

Casein kinase II phosphorylation site: 

amino acids 149-152, 166-169, 171-174, 187-190, 193-196, 195-198, 
303-306, 307-310, 335-338, 571-574 

N-myristoylation sites: 

amino acids 118-123, 229-234, 350-355, 446-451, 586-591 
Amidation sites: 

amino acids 263-266, 280-283, 438-441 
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FIGURE 11 

GGTAGGCGCGCCCAGACCTGAGACGGGTTGGGACTGGGCTGCGTCACGCGCGGGCTCTAAGCG 
CCCGGGGCCCCGCCCAGTGGCCGGCACAGCCAATCGCAGCGCGGGAAGGCGGTGGGGGCGGGG 
AAGGCCGCCTGGAAACTTAAATCCCGAGGCGGGCGAACCTGCACCAGACCGCGGACGTCTGTA 
ATCTCAGAGGCTTGTTTGCTGAGGGTGCCTGCGCAGCTGCGACGGCTGCTGGTTTTGAAACAT 
GAATCTTTCGCTCGTCCTGGCTGCCTTTTGCTTGGGAATAGCCTCCGCTGTTCCAAAATTTGA 
CCAAAATTTGGATACAAAGTGGTACCAGTGGAAGGCAACACACAGAAGATTATATGGCGCGAA 
TGAAGAAGGATGGAGGAGAGCAGTGTGGGAAAAGAATATGAAAATGATTGAACTGCACAATGG 
GGAATACAGCCAAGGG/^AACATGGCTTCACAATGGCCATGAATGCTTTTGGTGACATGACCAA 
TGAAGAATTCAGGCAGATGATGGGTTGCTTTCGAAACCAGAAATTCAGGAAGGGGAAAGTGTT 
CCGTGAGCCTCTGTTTCTTGATCTTCCCAAATCTGTGGATTGGAGAAAGAAAGGCTACGTGAC 
GCCAGTGAAGAATCAGAAACAGTGTGGTTCTTGTTGGGCTTTTAGTGCGACTGGTGCTCTTGA 
AGGACAGATGTTCCGGAAAACTGGGAAACTTGTCTCACTGAGCGAGCAGAATCTGGTGGACTG 
TTCGCGTCCTCAAGGCAATCAGGGCTGCAATGGTGGCTTCATGGCTAGGGCCTTCCAGTATGT 
CAAGGAGAACGGAGGCCTGGACTCTGAGGAATCCTATCCATATGTAGCAGTGGATGAAATCTG 
TAAGTACAGACCTGAGAATTCTGTTGCTAATGACACTGGCTTCACAGTGGTCGCACCTGGAAA 
GGAGAAGGCCCTGATGAAAGCAGTCGCAACTGTGGGGCCCATCTCCGTTGCTATGGATGCAGG 
CCATTCGTCCTTCCAGTTCTACAAATCAGGCATTTATTTTGAACCAGACTGCAGCAGCAAAAA 
CCTGGATCATGGTGTTCTGGTGGTTGGCTACGGCTTTGAAGGAGCAAATTCGT^ATAACAGCAA 
GTATTGGCTCGTCAAAAACAGCTGGGGTCCAGAATGGGGCTCGAATGGCTATGTAAAAATAGC 
CAAAGACAAGAACAACCACTGTGGAATCGCCACAGCAGCCAGCTACCCCAATGTGTGAGCTGA 
TGGATGGTGAGGAGGAAGGACTTAAGGACAGCATGTCTGGGGAAATTTTATCTTGAAACTGAC 
CAAACGCTTATTGTGTT^AGATAAACCAGTTGAATCATGGAGGATCCAAGTTGAGATTTTAATT 
CTGTGACATTTTTACAAGGGTAAAATGTTACCACTACTTTAATTATTGTTATACACAGCTTTA 
TGATATCAAAGACTCATTGCTTAATTCTAAGACTTTTGAATTTTCATTTTTTAAAAAGATGTA 
CAAAACAGTTTGAAATAAATTTTAATTCGTATATA 
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FIGURE 12 

MNLSLVLAAFCLGIASAVPKFDQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIELHN 
GEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQKFRKGKVFREPLFLDLPKSVDWRKKGYV 
TPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 
VKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTWAPGKEKALMKAVATVGPISVAMDA 
GHSSFQFYKSGIYFEPDCSSKNLDHGVLWGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKI 
AKDKNNHCGIATAASYPNV 

Important features: 
Signal sequence 

amino acids 1-17 

N-glycosylation sites. 

amino acids 2-6, 221-225, 292-296 

N-myristoylation sites. 

amino acids 13-19, 93-99, 136-142, 145-151, 174-180, 177-183, 
180-186, 194-200, 288-294, 324-330 

Eukaryotic thiol (cysteine) proteases cysteine active site. 

amino acids 132-144 

Eukaryotic thiol (cysteine) proteases histidine active site. 

amino acids 275-286 
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FIGURE 13 

GGCGGCGTCATGTGATCCGCTTCCCTGCTCCTTTAAGCGTCCACAGGCGGCGGAGCGGCCACA 
ATCACAGCTCCGGGCATTGGGGGAACCCGAGCCGGCTGCGCCGGGGGAATCCGTGCGGGCGCC 
TTCCGTCCCGGTCCCATCCTCGCCGCGCTCCAGCACCTCTGAAGTTTTGCAGCGCCCAGAAAG 
GAGGCGAGGAAGGAGGGAGTGTGTGAGAGGAGGGAGCAAAAAGCTCACCCTAAAACATTTATT 
TCAAGGAGAAAAGAAAAAGGGGGGGCGCAAAAMGGCTGGGGCAATTATAGAAAACATGAGCA 
CCAAGAAGCTGTGCATTGTTGGTGGGATTCTGCTCGTGTTCCAAATCATCGCCTTTCTGGTGG 
GAGGCTTGATTGCTCCAGGGCCCACAACGGCAGTGTCCTACATGTCGGTGAAATGTGTGGATG 
CCCGTAAGAACCATCACAAGACAAAATGGTTCGTGCCTTGGGGACCCAATCATTGTGACAAGA 
TCCGAGACATTGAAGAGGCAATTCCAAGGGAAATTGAAGCCAATGACATCGTGTTTTCTGTTC 
ACATTCCCCTCCCCCACATGGAGATGAGTCCTTGGTTCCAATTCATGCTGTTTATCCTGCAGC 
TGGACATTGCCTTCAAGCTAAACAACCAAATCAGAGAAAATGCAGAAGTCTCCATGGACGTTT 
CCCTGGCTTACCGTGATGACGCATTTGCTGAGTGGACTGAAATGGCCCATGAAAGAGTACCAC 
GGAAACTCAAATGCACCTTCACATCTCCCAAGACTCCAGAGCATGAGGGCCGTTACTATGAAT 
GTGATGTCCTTCCTTTCATGGAAATTGGGTCTGTGGCCCATAAGTTTT'ACCTTTTAAACATCC 
GGCTGCCTGTGAATGAGAAGAAGAAAATCAATGTGGGAATTGGGGAGATAAAGGATATCCGGT 
TGGTGGGGATCCACCAAAATGGAGGCTTCACCAAGGTGTGGTTTGCCATGAAGACCTTCCTTA 
CGCCCAGCATCTTCATCATTATGGTGTGGTATTGGAGGAGGATCACCATGATGTCCCGACCCC 
CAGTGCTTCTGGAAAAAGTCATCTTTGCCCTTGGGATTTCCATGACCTTTATCAATATCCCAG 
TGGAATGGTTTTCCATCGGGTTTGACTGGACCTGGATGCTGCTGTTTGGTGACATCCGACAGG 
GCATCTTCTATGCGATGCTTCTGTCCTTCTGGATCATCTTCTGTGGCGAGCACATGATGGATC 
AGCACGAGCGGAACCACATCGCAGGGTATTGGAAGCAAGTCGGACCCATTGCCGTTGGCTCCT 
TCTGCCTCTTCATATTTGACATGTGTGAGAGAGGGGTACAACTCACGAATCCCTTCTACAGTA 
TCTGGACTACAGACATTGGAACAGAGCTGGCCATGGCCTTCATCATCGTGGCTGGAATCTGCC 
TCTGCCTCTACTTCCTGTTTCTATGCTTCATGGTATTTCAGGTGTTTCGGAACATCAGTGGGA 
AGCAGTCCAGCCTGCCAGCTATGAGCAAAGTCCGGCGGCTACACTATGAGGGGCTAATTTTTA 
GGTTCAAGTTCCTCATGCTTATCACCTTGGCCTGCGCTGCCATGACTGTCATCTTCTTCATCG 
TTAGTCAGGTAACGGAAGGCCATTGGAAATGGGGCGGCGTCACAGTCCAAGTGAACAGTGCCT 
TTTTCACAGGCATCTATGGGATGTGGAATCTGTATGTCTTTGCTCTGATGTTCTTGTATGCAC 
CATCCCATAAAAACTATGGAGAAGACCAGTCCAATGGCGATCTGGGTGTCCATAGTGGGGAAG 
AACTCCAGCTCACCACCACTATCACCCATGTGGACGGACCCACTGAGATCTACAAGTTGACCC 
GCAAGGAGGCCCAGGAGTA6GAGGCTGCAGCGCCCGGCTGGGACGGTCTCTCCATACCCCAGC 
CCCTCTAACTAGAGTGGGGAGCATGCCAGAGAGAGCTCAATGTACAAATGAATGCCTCATGGC 
TCTTAGCTGTGGTTTCTTGGACCAGCGGCATGGACATTTGTCAGTTTGCCTTCTGACGGTAGC 
TTTTGGAGGAAGATTCCTGCAGCCACTAATGCATTGTGTATGATAACAAAAACTCTGGTATGA 
CACATTTTCTGTGATCATTGTTAATTAGTGACATAGTAACATCTGTAGCAGCTGGTTAGTAAA 
CCTCATGTGGGGGTGGGGTGGGGGTGTATTCCTTGGGGGATGGTTTGGGCCGAATGGGGAGTG 
GAATATTTGACATTTTTCCTGTTTTAAATTCTAGGATAGATTTTAACATCCTTTGCGGTCCCA 
GTCCAAGGTAGGCTGGTGTCATAGTCTTCTCACTCCTAATCCATGACCACTGTTTTTTTCCTA 
TTT-ATATCACCAGGTAGCCTACTGAGTTAATATTTAAGTTGTCAATAGATAAGTGTCCCTGTT 
TTGTGGCATAATATAACTGAATTTCATGAGAAGATTTATTCCACCAGGGGTATTTCAGCTTTG 
AAACCAAATCTGTGTATCTAATACTAACCAATCTGTTGGATGTGGATTTTAAAAAATGTTTGC 
TAAACTAGCCAAGTAAGATTTACTGTATTAAATGGCCTTCGGGTCTGAAAAGCTTTTTTAACC 
TCTTGCTTAAAATGCGTTTTATTTTGATAAGATACTTCAAATAGCCTCCAAAAGTGTAGATCC 
AATCACTTAAAT AAACC T GT AT GT AT ATGC AAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 14 

MAGAIIENMSTKKLCIVGGILLVFQIIAFLVGGLIAPGPTTAVSYMSVKCVDARKNHHKTKWF 
VPWGPNHCDKIRDIEEAIPREIEANDIVFSVHIPLPHMEMSPWFQFMLFILQLDIAFKLNNQI 
RENAEVSMDVSLAYRDDAFAEWTEMAHERVPRKLKCTFTSPKTPEHEGRYYECDVLPFMEIGS 
VAHKFYLLNIRLPVNEKKKINVGIGEIKDIRLVGIHQNGGFTKVWFAMKTFLTPSIFIIMVWY 
WRRITMMSRPPVLLEKVIFALGISMTFINIPVEWFSIGFDWTWMLLFGDIRQGIFYAMLLSFW 
IIFCGEHMMDQHERNHIAGYWKQVGPIAVGSFCLFIFDMCERGVQLTNPFYSIWTTDIGTELA 
MAFIIVAGICLCLYFLFLCFMVFQVFRNISGKQSSLPAMSKVRRLHYEGLIFRFKFLMLITLA 
CAAMTVIFFIVSQVTEGHWKWGGVTVQVNSAFFTGIYGMWNLYVFALMFLYAPSHKNYGEDQS 
NGDLGVHSGEELQLTTTITHVDGPTEIYKLTRKEAQE 

Important features of the protein : 
Signal peptide: 

amino acids 1-42 

Transmembrane domains : 

amino acids 239-253, 269-284, 302-318, 338-352, 377-399, 434-452, 
471-488 

N-glycosylation sites ♦ 

amino acids 8-12, 406-410 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 254-258 

N-myristoylation sites. 

amino acids 223-229, 274-280, 305-311, 358-364, 374-380, 386-392, 
509-515 
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FIGURE 15 

GTGAGGGGAACAGCTGATCCGTCTGTTGGGAGGACAGATATCTCAAGGCCAGGATGGAAGAAT 
CACCACTAAGCCGGGCACCATCCCGTGGTGGAGTCAACTTTCTCAATGTAGCCCGGACCTACA 
TCCCCAACACCAAGGTGGAATGTCACTACACCCTTCCCCCAGGCACCATGCCCAGTGCCAGTG 
ACTGGATTGGCATCTTCAAGGTGGAGGCTGCCTGTGTTCGGGATTACCACACATTTGTGTGGT 
CTTCCGTGCCTGAAAGTACAACTGATGGTTCCCCCATTCACACCAGTGTCCAGTTCCAAGCCA 
GCTACCTGCCCAAACCAGGAGCTCAGCTCTACCAGTTCCGATATGTGAACCGCCAGGGCCAGG 
TGTGTGGGCAGAGCCCCCCTTTCCAGTTCCGAGAGCCAAGGCCCATGGATGAACTGGTGACCC 
TGGAGGAGGCTGATGGGGGCTCTGACATCCTGCTGGTTGTCCCCAAGGCAACTGTGTTACAGA 
ACCAGCTCGATGAGAGCCAGCAAGAACGGAATGACCTGATGCAGCTGAAGCTACAGCTGGAGG 
GACAGGTGACAGAGCTGAGGAGCCGAGTGCAGGAGCTCGAGAGGGCTCTGGCAACTGCCAGGC 
AGGAGCACACGGAGCTGATGGAACAGTACAAGGGGATTTCCCGGTCCCATGGGGAGATCACAG 
AAGAGAGGGACATCCTGAGCCGGCAACAGGGAGACCATGTGGCACGCATCCTGGAGCTAGAGG 
ATGACATCCAGACCATCAGTGAGAAAGTGCTGACGAAGGAAGTGGAGCTGGACAGGCTTAGAG 
ACACAGTGAAGGCCCTGACTCGGGAACAAGAGAAGCTCCTTGGGCAACTGAAAGAAGTACAAG 
CAGACAAGGAGCAAAGTGAGGCTGAGCTCCAAGTGGCACAACAGGAGAACCATCACTTAAATT 
TGGACCTGAAGGAGGCGAAGAGCTGGCAAGAGGAGCAGAGTGCTCAGGCTCAGCGACTGAAAG 
ACAAGGTGGCCCAGATGAAGGACACCCTAGGCCAGGCCCAGCAGCGGGTGGCCGAGCTGGAGC 
CCTTGAAGGAGCAGCTTCGAGGGGCCCAGGAGCTTGCAGCCTCAAGCCAGCAGAAAGCCACCC 
TTCTTGGGGAGGAGTTGGCCAGTGCAGCAGCAGCCAGGGACCGCACCATAGCCGAACTACACC 
GCAGCCGCCTGGAAGTGGCTGAAGTTAACGGCAGGCTGGCTGAGCTCGGTTTGCACTTGAAGG 
AAGAAAAATGCCAATGGAGCAAGGAGCGGGCAGGGCTGCTGCAGAGTGTGGAGGCAGAGAAGG 
ACAAGATCCTGAAGCTGAGTGCAGAGATACTTCGATTGGAGAAGGCAGTTCAGGAGGAGAGGA 
CCCAAAACCAAGTGTTCAAGACTGAGCTGGCCCGGGAGAAGGATTCTAGCCTGGTACAGTTGT 
CAGAAAGTAAGCGGGAGCTGACAGAGCTGCGGTCAGCCCTGCGTGTGCTCCAGAAGGAAAAGG 
AGCAGTTACAGGAGGAGAAACAGGAATTGCTAGAGTACATGAGAAAGCTAGAGGCCCGCCTGG 
AGAAGGTGGCAGATGAGAAGTGGAATGAGGATGCCACCACAGAGGATGAGGAGGCCGCTGTGG 
GGCTGAGCTGCCCGGCAGCTCTGACAGACTCAGAGGACGAGTCCCCAGAAGACATGAGGCTCC 
CACCCTATGGCCTTTGTGAGCGTGGAGACCCAGGCTCCTCTCCTGCTGGGCCTCGAGAGGCTT 
CTCCCCTTGTTGTCATCAGCCAGCCGGCTCCCATTtCTCCTCACCTCTCTGGGCCAGCTGAGG 
ACAGTAGCTCTGACTCGGAGGCTGAAGATGAGAAGTCAGTCCTGATGGCAGCTGTGCAGAGTG 
GGGGTGAGGAGGCCAACTTACTGCTTCCTGAACTGGGCAGTGCCTTCTATGACATGGCCAGTG 
GCTTTACAGTGGGTACCCTGTCAGAAACCAGCACTGGGGGCCCTGCCACCCCCACATGGAAGG 
AGTGTCCTATCTGTAAGGAGCGCTTTCCTGCTGAGAGTGACAAGGATGCCCTGGAGGACCACA 
TGGATGGACACTTCTTTTTCAGCACCCAGGACCCCTTCACCTTTGAGTGATCTTACTCCCTCG 
TACATGCACAAATACACACTCATGCACACACACACTCACACACATGCATACACTTAGGTTTCA 
TGCCCATTTTCTATCACACTGGGCTCCATGATATTCTGTTCCCTAAGAACTGCTTCTGTGTGC 
CCTGTTTTCATCCCAAGATTTCTCACTTCATCCTCTCCTACCTGGCTCTTTTGTCCCAGGGAG 
GGGTCCTGTTCGGAAGCAGTGGCTGAATTTATCCCCTGAAAGTGGTTJTGGAGGAACCGGGAT 
GGAGGAGGCCTTCCCCTGTGGGAATAGAATCGTCCACTCCTAGCCCTGGTTGCTTCTGATACA 
CAGCCACTGCACACACACACTCACACTCACACTCCCTTGTCTGATGCCCCAAAGCCAATTCCT 
GGGGCACCCTACCCTCTCTTATTTGGAGTTTCCGTTGGTTTACCTGAGTTTTCTCTGGGGTCT 
GCACAGAGGCAGCAGCATGGACATCATGGCCTCTCAGGTCCCTTTTGGTTCTCAGTTTCATTG 
GTTCCTCTTTCTGTTCCCCCATTGACTTCTGTGCCCCACCCTAGCCTTTTCCATAACCTTAGG 
TATTCAGTTTGGAGGGGTTTTTTGTATTTTTGAGGATTCCTGTATTCTGTATCCTCTCCTCGC 
ATCTCCTCACATGGAAAGAAATAATGTATTTGTGCCTTCTGTGAGGAATGGGGGGAACAAGTG 
GTCCCAGGTATCCCCATTTCCAAGGCCCCCCTCCCTCTCCAGGTCCCCCCACAGCAATAAAAG 
CTTCCCCCTGATATCCATCCCTTTGTAGTTTGAACAAATATATTTATATGATATGTAA 
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FIGURE 16 

MEESPLSRAPSRGGVNFLNVARTYIPNTKVECHYTLPPGTMPSASDWIGIFKVEAACVRDYHT 
FVWSSVPESTTDGSPIHTSVQFQASYLPKPGAQLYQFRYVNRQGQVCGQSPPFQFREPRPMDE 
LVTLEEADGGSDILLWPKATVLQNQLDESQQERNDLMQLKLQLEGQVTELRSRVQELERALA 
TARQEHTELMEQYKGISRSHGEITEERDILSRQQGDHVARILELEDDIQTISEKVLTKEVELD 
RLRDTVKALTREQEKLLGQLKEVQADKEQSEAELQVAQQENHHLNLDLKEAKSWQEEQSAQAQ 
RLKDKVAQMKDTLGQAQQRVAELEPLKEQLRGAQELAASSQQKATLLGEELASAAAARDRTIA 
ELHRSRLEVAEVNGRLAELGLHLKEEKCQWSKERAGLLQSVEAEKDKILKLSAEILRLEKAVQ 
EERTQNQVFKTELAREKDSSLVQLSESKRELTELRSALRVLQKEKEQLQEEKQELLEYMRKLE 
ARLEKVADEKWNEDATTEDEEAAVGLSCPAALTDSEDESPEDMRLPPYGLCERGDPGSSPAGP 
REASPLWISQPAPISPHLSGPAEDSSSDSEAEDEKSVLMAAVQSGGEEANLLLPELGSAFYD 
MASGFTVGTLSETSTGGPATPTWKECPICKERFPAESDKDALEDHMDGHFFFSTQDPFTFE 

Important features: 

Casein kinase II phosphorylation sites: 

amino acids 28-31, 43-46, 68-71, 72-75, 129-132, 156-159, 208- 

211, 239-242, 282-285, 305-308, 376-379, 383-383, 468-471, 520- 

523, 521-524, 537-540, 539-542, 543-546, 593-596, 595-598, 597- 

600, 612-615, 639-642, 652-655, 667-670, 683-686 

N-myristoylation sites: 

amino acids 39-44, 107-112, 204-209, 414-419, 561-566, 613-618 

Cell attachment sequence: 

amino acids 557-559 



Leucine zipper pattern sequence: 

amino acids 163-184, 475-496, 482-503 
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FIGURE 17 

GCAAGTTGGGAATTTTAGACTGTCACTGCACATGGACCTCTGGGAAGACGTCTGGCGAGAGCT 
AGGCCCACTGGCCCTACAGACGGATCTTGCTGGCTCACCTGTCCCTGTGGAGGTTCCCCTGGG 
AAGGCAAGATGCCCAACAACAGCACTGCTCTGTCATTGGCCAATGTTACCTACATCACCATGG 
AAATTTTCATTGGACTCTGCGCCATAGTGGGCAACGTGCTGGTCATCTGCGTGGTCAAGCTGA 
ACCCCAGCCTGCAGACCACCACCTTCTATTTCATTGTCTCTCTAGCCCTGGCTGACATTGCTG 
TTGGGGTGCTGGTCATGCCTTTGGCCATTGTTGTCAGCCTGGGCATCACAATCCACTTCTACA 
GCTGCCTTTTTATGACTTGCCTACTGCTTATCTTTACCCACGCCTCGATCATGTCCTTGCTGG 
CCATCGCTGTGGACCGATACTTGCGGGTCAAGCTTACCGTCAGATTCAGAATTCCTGGGCTCC 
CTGGGTGCATTCTATCATTCCAGTTGAAAGTTTGCTTCCTTCCAGTCATGTGGCTCTTCATTC 
TACTCTCCTTGGCTCTCATTTCAGATGCCATGGTCATGGATGAAAAGGTCAAGAGAAGCTTTG 
TGCTGGACACGGCTTCTGCCATCTGCAACTACAATGCCCACTACAAGAATCACCCCAAATACT 
GGTGCCGAGGCTATTTCCGTGACTACTGCAACATCATCGCCTTCTCCCCTAACAGCACCAATC 
ATGTGGCCCTGAGGGACACAGGGAACCAGCTCATTGTCACTATGTCCTGCCTGACCAAAGAGG 
ACACGGGCTGGTACTGGTGTGGCATCCAGCGGGACTTTGCCAGGGATGACATGGATTTTACAG 
AGCTGATTGTAACTGACGACAAAGGAACCCTGGCCAATGACTTTTGGTCTGGGAAAGACCTAT 
CAGGCAACAAAACCAGAAGCTGCAAGGCTCCCAAAGTTGTCCGCAAGGCTGACCGCTCCAGGA 
CGTCCATTCTCATCATTTGCATACTGATCACGGGTTTGGGAATCATCTCTGTAATCAGTCATT 
TGACCAAAAGGAGGAGAAGTCAAAGGAATAGAAGGGTAGGCAACACTTTGAAGCCCTTCTCGC 
GTGTCCTGACTCCAAAGGAAATGGCTCCTACTGAACAGATGTGACTGAAGATTTTTTTAATTT 
AGTTCATAAAGTGATGCTACAACAGAATAATCACCATGACAACTGGCCCACACCTCAGAGACT 
GATTCTGATCTCCCAGGAATTCTGAAGGACCCTCTATCCTTGACAACAATCATTTGCAGCCAG 
GTAGCAACGGCGGTAGTCAGAGGAGCTATGATAGACCACACCCAAGCAAGGCTGCCCTCAAAT 
AACATCTCAAGATCTTAGTTCTTATGCATTCCATCAGTCAGAAGTGAAGAAGAGGTGGAGAAT 
CTGGATTGGGGACCAGGAAATCACTTGTATTTTGTTAGCCAATAAATTCCTAGCCAGTGTTGA 
ATGAAAAAAAAAAAAA 
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FIGURE 18 

MPNNSTALSLANVTYITMEIFIGLCAIVGNVLVICVVKLNPSLQTTTFYFIVSLALADIAVGV 
LVMPLAIWSLGITIHFYSCLFMTCLLLIFTHASIMSLLAIAVDRYLRVKLTVRFRIPGLPGC 
ILSFQLKVCFLPVMWLFILLSLALISDAMVMDEKVKRSFVLDTASAICNYNAHYKNHPKYWCR 
GYFRDYCNIIAFSPNSTNHVALRDTGNQLIVTMSCLTKEDTGWYWCGIQRDFARDDMDFTELI 
VTDDKGTLANDFWSGKDLSGNKTRSCKAPKWRKADRSRTSILIICILITGLGIISVISHLTK 
RRRSQRNRRVGNTLKPFSRVLTPKEMAPTEQM 

Important features of the protein: 
Transmembrane domains: 

amino acids 16-35, 62-80, 89-101, 134-152, 292-311 
N-glycosylation sites. 

amino acids 3-7, 4-8, 12-16, 204-208, 273-277 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 316-320 

N-myristoylation sites. 

amino acids 122-128, 125-131, 258-264 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 214-225 

G-protein coupled receptors proteins. 

amino acids 29-59, 76-116 
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FIGURE 19 

CTCGGGCGCGCACAGGCAGCTCGGTTTGCCCTGCGATTGAGCTGCGGGTCGCGGCCGGCGCCGGCCTCTCCAATG 
GCAAATGTGTGTGGCTGGAGGCGAGCGCGAGGCTTTCGGCAAAGGCAGTCGAGTGTTTGCAGACCGGGGCGAGTC 
CTGTGAAAGCAGATAAAAGAAAACATTTATTAACGTGTCATTACGAGGGGAGCGCCCGGCCGGGGCTGTCGCACT 
CCCCGCGGAACATTTGGCTCCCTCCAGCTCCGAGAGAGGAGT^AGAAGAAAGCGGAAAAGAGGCAGATTCACGTCG 
TTTCCAGCCAAGTGGACCTGATCGATGGCCCTCCTGAATTTATCACGATATTTGATTTATTAGCGATGCCCCCTG 
GTTTGTGTGTTACGCACACACACGTGCACACAAGGCTCTGGCTCGCTTCCCTCCCTCGTTTCCAGCTCCTGGGCG 
AATCCCACATCTGTTTCAACTCTCCGCCGAGGGCGAGCAGGAGCGAGAGTGTGTCGAATCTGCGAGTGAAGAGGG 
ACGAGGGAAAAGAAACAAAGCCACAGACGCAACTTGAGACTCCCGCATCCCAAAAGAAGCACCAGATCAGCAAAA 
AAAGAAGATGGGCCCCCCGAGCCTCGTGCTGTGCTTGCTGTCCGCAACTGTGTTCTCCCTGCTGGGTGGAAGCTC 
GGCCTTCCTGTCGCACCACCGCCTGAAAGGCAGGTTTCAGAGGGACCGCAGGAACATCCGCCCCAACATCATCCT 
GGTGCTGACGGACGACCAGGATGTGGAGCTGGGTTCCATGCAGGTGATGAACAAGACCCGGCGCATCATGGAGCA 
GGGCGGGGCGCACTTCATCAACGCCTTCGTGACCACACCCATGTGCTGCCCCTCACGCTCCTCCATCCTCACTGG 
CAAGTACGTCCACAACCACAACACCTACACCAACAATGAGAACTGCTCCTCGCCCTCCTGGCAGGCACAGCACGA 
GAGCCGCACCTTTGCCGTGTACCTCAATAGCACTGGCTACCGGACAGCTTTCTTCGGGAAGTATCTTAATGAATA 
CAACGGCTCCTACGTGCCACCCGGCTGGAAGGAGTGGGTCGGACTCCTTAAAAACTCCCGCTTTTATAACTACAC 
GCTGTGTCGGAACGGGGTGAAAGAGAAGCACGGCTCCGACTACTCCAAGGATTACCTCACAGACCTCATCACCAA 
TGACAGCGTGAGCTTCTTCCGCACGTCCAAGAAGATGTACCCGCACAGGCCAGTCCTCATGGTCATCAGCCATGC 
AGCCCCCCACGGCCCTGAGGATTCAGCCCCACAATATTCACGCCTCTTCCCAAACGCATCTCAGCACATCACGCC 
GAGCTACAACTACGCGCCCAACCCGGACAAACACTGGATCATGCGCTACACGGGGCCCATGAAGCCCATCCACAT 
GGAATTCACCAACATGCTCCAGCGGAAGCGCTTGCAGACCCTCATGTCGGTGGACGACTCCATGGAGACGATTTA 
CAACATGCTGGTTGAGACGGGCGAGCTGGACAACACGTACATCGTATACACCGCCGACCACGGTTACCACATCGG 
CCAGTTTGGCCTGGTGAAAGGGAAATCCATGCCATATGAGTTTGACATCAGGGTCCCGTTCTACGTGAGGGGCCC 
CAACGTGGAAGCCGGCTGTCTGAATCCCCACATCGTCCTCAACATTGACCTGGCCCCCACCATCCTGGACATTGC 
AGGCCTGGACATACCTGCGGATATGGACGGGAAATCCATCCTCAAGCTGCTGGACACGGAGCGGCCGGTGAATCG 
GTTTCACTTGAAAAAGAAGATGAGGGTCTGGCGGGACTCCTTCTTGGTGGAGAGAGGCAAGCTGCTACACAAGAG 
AGACAATGACAAGGTGGACGCCCAGGAGGAGAACTTTCTGCCCAAGTACCAGCGTGTGAAGGACCTGTGTCAGCG 
TGCTGAGTACCAGACGGCGTGTGAGCAGCTGGGACAGAAGTGGCAGTGTGTGGAGGACGCCACGGGGAAGCTGAA 
GCTGCATAAGTGCAAGGGCCCCATGCGGCTGGGCGGCAGCAGAGCCCTCTCCAACCTCGTGCCCAAGTACTACGG 
GCAGGGCAGCGAGGCCTGCACCTGTGACAGCGGGGACTACAAGCTCAGCCTGGCCGGACGCCGGAAAAAACTCTT 
CAAGAAGAAGTACAAGGCCAGCTATGTCCGCAGTCGCTCCATCCGCTCAGTGGCCATCGAGGTGGACGGCAGGGT 
GTACCACGTAGGCCTGGGTGATGCCGCCCAGCCCCGAAACCTCACCAAGCGGCACTGGCCAGGGGCCCCTGAGGA 
CCAAGATGACAAGGATGGTGGGGACTTCAGTGGCACTGGAGGCCTTCCCGACTACTCAGCCGCCAACCCCATTAA 
AGTGACACATCGGTGCTACATCCTAGAGAACGACACAGTCCAGTGTGACCTGGACCTGTACAAGTCCCTGCAGGC 
CT GG AAAG ACC AC AAGCTGCAC ATC GACC AC G AGATT G AAACC C T GC AG AAC AAAAT T AAG AAC CTG AG GG AAGT 
CCGAGGTCACCTGAAGAAAAAGCGGCCAGAAGAATGTGACTGTCACAAAATCAGCTACCACACCCAGCACAAAGG 
CCGCCTCAAGCACAGAGGCTCCAGTCTGCATCCTTTCAGGAAGGGCCTGCAAGAGMGGACAAGGTGTGGCTGTT 
GCGGGAGCAGAAGCGCAAGAAGAAACTCCGCAAGCTGCTCAAGCGCCTGCAGAACAACGACACGTGCAGCATGCC 
AGGCCTCACGTGCTTCACCCACGACAACCAGCACTGGCAGACGGCGCCTTTCTGGACACTGGGGCCTTTCTGTGC 
CTGCACCAGCGCCAACAATAACACGTACTGGTGCATGAGGACCATCAATGAGACTCACAATTTCCTCTTCTGTGA 
ATTTGCAACTGGCTTCCTAGAGTACTTTGATCTCAACACAGACCCCTACCAGCTGATGAATGCAGTGAACACACT 
GGACAGGGATGTCCTCAACCAGCTACACGTACAGCTCATGGAGCTGAGGAGCTGCAAGGGTTACAAGCAGTGTAA 
CCCCCGGACTCGAAACATGGACCTGGATGGAGGAAGCTATGAGCAATACAGGCAGTTTCAGCGTCGAAAGTGGCC 
AGAAATGAAGAGACCTTCTTCCAAATCACTGGGACAACTGTGGGAAGGCTGGGAAGGTTAAGAAACAACAGAGGT 
GGACCTCCAAAAACATAGAGGCATCACCTGACTGCACAGGCAATGAAAAACCATGTGGGTGATTTCCAGCAGACC 
TGTGCTATTGGCCAGGAGGCCTGAGAAAGCAAGCACGCACTCTCAGTCAACATGACAGATTCTGGAGGATAACCA 
GCAGGAGCAGAGATAACTTCAGGAAGTCCATTTTTGCCCCTGCTTTTGCTTTGGATTATACCTCACCAGCTGCAC 
AAAATGCATTTTTTCGTATCAAAAAGTCACCACTAACCCTCCCCCAGAAGCTCACAAAGGAAAACGGAGAGAGCG 
AGCGAGAGAGATTTCCTTGGAAATTTCTCCCAAGGGCGAAAGTCATTGGAATTTTTAAATCATAGGGGAAAAGCA 
GTCCTGTTCTAAATCCTCTTATTCTTTTGGTTTGTCACAAAGAAGGAACTAAGAAGCAGGACAGAGGCAACGTGG 
AGAGGCTGAAAACAGTGCAGAGACGTTTGACAATGAGTCAGTAGCACAAAAGAGATGACATTTACCTAGCACTAT 
AAACCCTGGTTGCCTCTGAAGAAACTGCCTTCATTGTATATATGTGACTATTTACATGTAATCAACATGGGAACT 
TTTAGGGGAACCTAATAAGAAATCCCAATTTTCAGGAGTGGTGGTGTCAATAAACGCTCTGTGGCCAGTGTAAAA 
GAAAAA 
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FIGURE 20 

MGPPSLVLCLLSATVFSLLGGSSAFLSHHRLKGRFQRDRRNIRPNIILVLTDDQDVELGSMQV 
MNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGKYVHNHNTYTNNENCSSPSWQAQHESRT 
FAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGSDYSK 
DYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFPNASQHITPSYNY 
APNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVDDSMETIYNMLVETGELDNTYIVYT 
ADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCLNPHIVLNIDLAPTILDIAGLDIP 
ADMDGKSILKLLDTERPVNRFHLKKKMRVWRDSFLVERGKLLHKRDNDKVDAQEENFLPKYQR 
VKDLCQRAEYQTACEQLGQKWQCVEDATGKLKLHKCKGPMRLGGSRALSNLVPKYYGQGSEAC 
TCDSGDYKLSLAGRRKKLFKKKYKASYVRSRSIRSVAIEVDGRVYHVGLGDAAQPRNLTKRHW 
PGAPEDQDDKDGGDFSGTGGLPDYSAANPIKVTHRCYILENDTVQCDLDLYKSLQAWKDHKLH 
IDHEIETLQNKIKNLREVRGHLKKKRPEECDCHKISYHTQHKGRLKHRGSSLHPFRKGLQEKD 
KVWLLREQKRKKKLRKLLKRLQNNDTCSMPGLTCFTHDNQHWQTAPFWTLGPFCACTSANNNT 
YWCMRTINETHNFLFCEFATGFLEYFDLNTDPYQLMNAVNTLDRDVLNQLHVQLMELRSCKGY 
KQCNPRTRNMDLDGGSYEQYRQFQRRKWPEMKRPSSKSLGQLWEGWEG 

Important features: 
Signal peptide: 

amino acids 1-17 

Sulf atases signature 1 . 

amino acids 86-99 

Homologous region to sulfatase: 

amino acids 87-106, 133-146, 216-229, 291-320, 365-375 
N-glycosylation sites. 

amino acids 65-69, 112-116, 132-136, 149-153, 171-175, 198-202, 
241-245, 561-565, 608-612, 717-721, 754-758, 764-768 
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FIGURE 21 

GGGCGCGCGAGAGCTGCTAGGGCGGTTTCTCTGCCTCGGGCCTGTTGGGCAGGGCCGGCT 
AAGGTGCGCGTGCTCGCTGGTTCTAACCCTTCTGTTGGGCGTTTCTGCTGAGAGGCGGGA 
GGCGCTGAGAGTCTGTGCGGAGGTCCGTGGACAGACTGCTTTGCTCGTTGTTGCTCTTCG 
GAGGCGGCGATCCCCGAAGGCGAGCTGAAATACGGCTGCAGGCTACAATTTGCAGCCGAC 
GATTATGGAAGACGGAAGCGGGAGAGGTGGCCCACCCTCATGGAGCGCTTGTGCTCGGAT 
GGCTTCGCATTTCCCCAATACCCCATTAAACCGTATCATCTGAAGAGGATCCACAGAGCT 
GTCTTACATGGTAATCTAGAGAAACTGAAGTACCTTCTGCTCACGTATTATGACGCCAAT 
AAGAGAGACAGGAAGGAAAGGACCGCCCTACATTTGGCCTGTGCCACTGGCCAACCGGAA 
ATGGTACATCTCCTGGTGTCCAGAAGATGTGAGCTTAACCTCTGCGACCGTGAAGACAGG 
ACACCTCTGATCAAGGCTGTACAACTGAGGCAGGAGGCTTGTGCAACTCTTCTGCTGCAA 
AATGGCGCCAATCCAAATATTACGGATTTCTTTGGAAGGACTGCTCTGCACTACGCTGTG 
TATAATGAAGATACATCCATGATAGAAAAACTTCTTTCACATGGTACAAATATTGAAGAA 
TGCAGCAAGGTATAGGTCAACCAATGTTATTTTCAAACTATCTGAAATGAATTTATTTTA 
ACATTGACACATGTAAGGGTCAATTTTTCATATTTGGAAGCTCAAACATTCCTTGAATGA 
AAATATTTTGAAATGCCTTAACTGTCTAAGATTTTACTTTAAATATTGGAACTTTTAAAG 
AAGCATTATAGGGAACAGCCTTTTTTCATGCACTTATGGTAAATAACTATAAAAACAAAT 
GAATTACAATAAATTTATAATTCATGACAACTGAATTTGGGAAAGGTAATAGTTAAGTGT 
TTTTCCACTAAATTACTTTTT 
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FIGURE 22 

MERLCSDGFAFPQYPIKPYHLKRIHRAVLHGNLEKLKYLLLTYYDANKRDRKERTALHLACAT 
GQPEMVHLLVSRRCELNLCDREDRTPLIKAVQLRQEACATLLLQNGANPNITDFFGRTALHYA 
VYNEDTSMIEKLLSHGTNIEECSKV 

Important features of the protein: 
N-glycosylation site. 

amino acids 113-117 

N-myristoylation site. 

amino acids 109-115 

Microbodies C- terminal targeting signal. 

amino acids 149-153 
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FIGURE 23 

GAGGCAGAAAGGCAGAAAGGAGAAAATTCAGGATAACTCTCCTGAGGGGTGAGCCAAGCCCTG 
CCATGTAGTGCACGCAGGACATCAACAAACACAGATAACAGGAAATGATCCATTCCCTGTGGT 
CACTTATTCTAAAGGCCCCAACCTTCAAAGTTCAAGTAGTGATATGGATGACTCCACAGAAAG 
GGAGCAGTCACGCCTTACTTCTTGCCTTAAGAAAAGAGAAGAAATGAAACTGAAGGAGTGTGT 
TTCCATCCTCCCACGGAAGGAAAGCCCCTCTGTCCGATCCTCCAAAGACGGAAAGCTGCTGGC 
TGCAACCTTGCTGCTGGCACTGCTGTCTTGCTGCCTCACGGTGGTGTCTTTCTACCAGGTGGC 
CGCCCTGCAAGGGGACCTGGCCAGCCTCCGGGCAGAGCTGCAGGGCCACCACGCGGAGAAGCT 
GCCAGCAGGAGCAGGAGCCCCCAAGGCCGGCCTGGAGGAAGCTCCAGCTGTCACCGCGGGACT 
GAAAATCTTTGAACCACCAGCTCCAGGAGAAGGC7^ACTCCAGTCAGAACAGCAGAAATAAGCG 
TGCCGTTCAGGGTCCAGAAGAAACAGTCACTCAAGACTGCTTGCAACTGATTGCAGACAGTGA 
AACACCAACTATACA7\AAAGGATCTTACACATTTGTTCCATGGCTTCTCAGCTTTAAAAGGGG 
AAGTGCCCTAGAAGAAAAAGAGAATAAAATATTGGTCAAAGAT^ACTGGTTACTTTTTTATATA 
TGGTCAGGTTTTATATACTGATAAGACCTACGCCATGGGACATCTAATTCAGAGGAAGAAGGT 
CCATGTCTTTGGGGATGAATTGAGTCTGGTGACTTTGTTTCGATGTATTCAAAATATGCCTGA 
AACACTACCCAATAATTCCTGCTATTCAGCTGGCATTGCAAAACTGGAAGAAGGAGATGAACT 
CCAACTTGCAATACCAAGAGAAAATGCACAAATATCACTGGATGGAGATGTCACATTTTTTGG 
TGCATTGAAACTGCTGTGACCTACTTACACCATGTCTGTAGCTATTTTCCTCCCTTTCTCTGT 
ACCTCTAAGAAGAAAGAATCTAACTGAAAATACCAAAAAAAAAAAAAAAA 
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FIGURE 24 

MDDSTEREQSRLTSCLKKREEMKLKECVSILPRKESPSVRSSKDGKLLAATLLLALLSCCLTV 
VSFYQVAALQGDLASLRAELQGHHAEKLPAGAGAPKAGLEEAPAVTAGLKIFEPPAPGEGNSS 
QNSRNKRAVQGPEETVTQDCLQLIADSETPTIQKGSYTFVPWLLSFKRGSALEEKENKILVKE 
TGYFFIYGQVLYTDKTYAMGHLIQRKKVHVFGDELSLVTLFRCIQNMPETLPNNSCYSAGIAK 
LEEGDELQLAIPRENAQISLDGDVTFFGALKLL 

Transmembrane domain: 

amino acids 47-72 

N-glycosylation site. 

amino acids 124-127, 242-245 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36, 173-176 

N-myristoylation site. 

amino acids 96-101 

TNF family proteins. 

amino acids 172-206 
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FIGURE 25 

CTGCTTGGATACCTCCAGTCCCCAAACTGTGTTCCAGGAGTTTTCTTGGCCGAAGCTGCCCGA 
TGTTTGAGCCTTTTCTTCCCAGAGAAGAAGATGGACTGAAAGCTGCCAGTTGGGGACTTTTTG 
TGATCACGGCGTTGCAGCGTTTTAAAGGAGGTGATGGGGCTTGCGCTGGCTTGTCTTCCCACC 
CAAGTGAAGAGTTGATGTTCACTGGTTATGCTTAGACAATGTGCAGTTTGTGTTAATTTAAAA 
TTTTGGGTGGGATAGGGGCATAGGCTTGTGAAGGGCAGTCCGGATCCGGAGGAACTCGTCTTT 
GTCCCTGGTAGGAGAGACACCCCCAGTCTATCCTCGATGCCGTCAGCCTTGGCCATCTTCACT 
TGCCGCCCGAACTCGCACCCGTTTCAGGAGCGTCATGTCTACCTGGACGAGCCCATCAAAATC 
GGCCGCTCAGTGGCCCGCTGTCGACCAGCGCAGAATAATGCCACTTTTGATTGCAAAGTGCTA 
TCAAGGAACCACGCTCTCGTCTGGTTTGATCACAAGACGGGCAAGTTTTATCTTCAAGACACT 
AAAAGTAGTAATGGTACTTTTATAAATAGCCAGAGATTGAGTCGAGGCTCTGAAGAAAGTCCA 
CCATGTGAAATTCTTTCCGGTGACATTATCCAGTTTGGAGTAGACGTGACAGAGAATACACGG 
AAAGTTACCCATGGGTGTATTGTTTCCACAATAAAACTTTTTCTACCAGATGGTATGGAAGCC 
CGGCTCCGCTCAGATGTCATCCATGCACCATTACCAAGTCCTGTTGACAAAGTTGCTGCTAAC 
ACTCCAAGTATGTACTCTCAGGAACTATTCCAGCTTTCTCAGTATCTACAGGAGGCCTTACAT 
CGGGAACAAATGTTGGAACAGAAGTTAGCCACGCTTCAGCGGCTACTAGCCATCACCCAAGAG 
GCTTCAGATACCAGTTGGCAGGCTTTAATAGATGAAGATAGACTCTTATCACGGTTAGAAGTT 
ATGGGAAACCAATTACAGGCATGCTCCAAAAATCAAACAGAAGATAGTTTACGAAAGGAACTT 
ATAGCATTACAAGAGGATAAACATAACTATGAGACAACAGCCAAAGAGTCCCTGAGGCGGGTT 
CTTCAGGAGAAAATTGAAGTGGTTAGAAAACTTTCAGT^AGTTGAGCGAAGTCTGAGTAATACT 
GAAGATGAATGTACCCATCTGAAAGAAATGAATGAAAGGACTCAGGAAGAATTAAGAGAATTA 
GCCAACAAATATAATGGAGCAGTTAATGAGATTAAAGATTTATCTGATAAATTAAAGGTAGCA 
GAGGGAAAACAAGAGGAAATCCAACAGAAGGGACAGGCTGAGAAAAAAGAATTACAACATAAA 
ATAGATGAAATGGAAGAAAAAGAACAGGAGCTCCAGGCAAAAATAGAAGCTTTGCAAGCTGAT 
AATGATTTCACCAATGAAAGGCTAACAGCTTTACAAGTACGGTTAGAACATCTTCAGGAGAAA 
ACTCTTAAAGAATGCAGCAGCTTGGCTGATCGTCGAAGGGCATCTAACCAAAGCGGTAGAAGA 
AACAAAGCTTTCAAAAGGTTTGTTTTCTGTTTTTCTATGTTTTTTGACAGTTCTTTTGGATAA 
TGAAGGTTAGTGTATATTTTCAAGGTTATAGTATTTTAACCATCAGTTTACTTCTTATAGCTC 
ACAAAATAGCAAGCCAGTAACAGTATCAGATAATATATAAAATAATCAGACTTCTGTTTTAAG 
AAGGGTATCGTAACTGGAATGTGTCTTTTTAAGTGGATGTATATTTATGGTTTTTTGAATGTT 
AGTACTTGATATAGGTTTCTTTAGGTATTAAAGATTTGTTGCAATCTCTGTCATTCCCAGCAT 
TAATTTCAGCTTTGATCTCAAATTTTAATCAAACACAATGTAAGTCGTTTGTGATACAACTTA 
AGTGAAACATGCTTGCACTTCTATTTTGGGGGTTACAGTACCTTTAAAATCTCTTATGATGTT 
TAATATTTCCTTAATTTTTGGCATCTCAGTTTGATTTAAACAAAATTAATGACTTTTGTGAAT 
GTAGAATCTTCTTATATTTTATGAGTAGTCCAGTAATTGCCCAAAGTAGTTTATTGTGTTAAT 
TCTGTTACAGTTGTCAGAGAAGAAAAGTGAGTTTTAAAGCACCATATTGTCAAGTCACTTTTA 
TACATAGGGAAATTAGGCAAATAAATTTGGTGGCATGTGTTTATCATAGTAGAACTTTCATTA 
GACTATACCAGTATAAAATTTAAAACTAGATTCACAGTCCTTTTGGCCAATTAAAACATTGAG 
TTACAAAAGTTTGAGATACTTAATTTTAGTACATTCTATTTTATTAAAGTAACTGGATTCATT 
TGACTTTTTTAACCATGTAAGAGGATGGTGTTATTTCAAATATCTCGTGGTTTCCATTCTGAA 
TTTTGTGCACGGCAGATGCCATATTTGGGGAAAAAATGCATAGAATATGCATCATTAATATTG 
TTTTGGCAAACAGGCATTGAGTTTCAGAACAGTGAACTATTTTTAGTACATATGGCAATTTTT 
TTCACCTTATTAAAGTGAGATGAGAACAGACCTTAAAATAGCTTTTACCTCACCATCCAAATA 
CCTATTCAGATTAGTTGGTTGAATAGCCAGCACTTTGAAGTAGAGCCTTAGG 
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FIGURE 26 

MEARLRSDVIHAPLPSPVDKVAANTPSMYSQELFQLSQYLQEALHREQMLEQKLATLQRLLAI 
TQEASDTSWQALIDEDRLLSRLEVMGNQLQACSKNQTEDSLRKELIALQEDKHNYETTAKESL 
RRVLQEKIEVVRKLSEVERSLSNTEDECTHLKEMNERTQEELRELANKYNGAVNEIKDLSDKL 
KVAEGKQEEIQQKGQAEKKELQHKIDEMEEKEQELQAKIEALQADNDFTNERLTALQVRLEHL 
QEKTLKECSSLADRRRASNQSGRRNKAFKRFVFCFSMFFDSSFG 

Important features of the protein: 
N-glycosylation sites . 

amino acids 98-102, 271-275 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 138-142, 267-271 

Amidation site. 

amino acids 273-277 

Tropomyosins proteins. 

amino acids 169-217 
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FIGURE 27 

GAACCTGGCGCCGCCGGAACTGATCGCGGCCTAGTCCCGACGCGTGTGTGCTAGTGAGCCGGA 
GCCGGCGACGGCGGCAGTGGCGGCCCGGCCTGCAGGAGCCCGACGGGGTCTCTGCCATGGGGG 
AGTGACGCGCCTGCACCCGCTGTTCCGCGGCAGCGGCGAGACATGAGGAGACCCCGCGACAGG 
GGCAGCGGCGGCGGCTCGTGAGCCCCGGGM6GAGGAGAAATACGGCGGGGACGTGCTGGCCG 
GCCCCGGCGGCGGCGGCGGCCTTGGGCCGGTGGACGTACCCAGCGCTCGATTAACAAAATATA 
TTGTGTTACTATGTTTCACTAAATTTTTGAAGGCTGTGGGACTTTTCGAATCATATGATCTCC 
TAAAAGCTGTTCACATTGTTCAGTTCATTTTTATATTAAAACTTGGGACTGCATTTTTTATGG 
TTTTGTTTCAAAAGCCATTTTCTTCTGGGAAAACTATTACCAAACACCAGTGGATC7\AAATAT 
TTAAACATGCAGTTGCTGGGTGTATTATTTCACTCTTGTGGTTTTTTGGCCTCACTCTTTGTG 
GACCACTAAGGACTTTGCTGCTATTTGAGCACAGTGATATTGTTGTCATTTCACTACTCAGTG 
TTTTGTTCACCAGTTCTGGAGGAGGACCAGCAAAGACAAGGGGAGCTGCTTTTTTCATTATTG 
CTGTGATCTGTTTATTGCTTTTTGACAATGATGATCTCATGGCTA7yVATGGCTGAACACCCTG 
AAGGACATCATGACAGTGCTCTAACTCATATGCTTTACACAGCCATTGCCTTCTTAGGTGTGG 
CAGATCACAAGGGTGGAGTATTATTGCTAGTACTGGCTTTGTGTTGTAAAGTTGGTTTTCATA 
CAGCTTCCAGAAAGCTCTCTGTCGACGTTGGTGGAGCTAAACGTCTTCAAGCTTTATCTCATC 
TTGTTTCTGTGCTTCTCTTGTGCCCATGGGTCATTGTTCTTTCTGTGACAACTGAGAGTAAAG 
TGGAGTCTTGGTTTTCTCTCATTATGCCTTTTGCAACGGTTATCTTTTTTGTCATGATCCTGG 
ATTTCTACGTGGATTCCATTTGTTCAGTCAAAATGGAAGTTTCCAAATGTGCTCGTTATGGAT 
CCTTTCCCATTTTTATTAGTGCTCTCCTTTTTGGAAATTTTTGGACACATCCAATAACAGACC 
AGCTTCGGGCTATGAACAAAGCAGCACACCAGGAGAGCACTGAACACGTCCTGTCTGGAGGAG 
TGGTAGTGAGTGCTATATTCTTCATTTTGTCTGCCAATATCTTATCATCTCCCTCTAAGAGAG 
GACAAAAAGGTACCCTTATTGGATATTCTCCTGAAGGAACACCTCTTTATAACTTCATGGGTG 
ATGCTTTTCAGCATAGCTCTCAATCGATCCCTAGGTTTATTAAGGAATCACTAAAACAAATTC 
TTGAGGAGAGTGACTCTAGGCAGATCTTTTACTTCTTGTGCTTGAATCTGCTTTTTACCTTTG 
TGGAATTATTCTATGGCGTGCTGACCAATAGTCTGGGCCTGATCTCGGATGGATTCCACATGC 
TTTTTGACTGCTCTGCTTTAGTCATGGGACTTTTTGCTGCCCTGATGAGTAGGTGGAAAGCCA 
CTCGGATTTTCTCCTATGGGTACGGCCGAATAGAAATTCTGTCTGGATTTATTAATGGACTTT 
TTCTAATAGTAATAGCGTTTTTTGTGTTTATGGAGTCAGTGGCTAGATTGATTGATCCTCCAG 
AATTAGACACTCACATGTTAACACCAGTCTCAGTTGGAGGGCTGATAGTAAACCTTATTGGTA 
TCTGTGCCTTTAGCCATGCCCATAGCCATGCCCATGGAGCTTCTCAAGGAAGCTGTCACTCAT 
CTGATCACAGCCATTCACACCATATGCATGGACACAGTGACCATGGGCATGGTCACAGCCACG 
GATCTGCGGGTGGAGGCATGAATGCTAACATGAGGGGTGTATTTCTACATGTTTTGGCAGATA 
CACTTGGCAGCATTGGTGTGATCGTATCCACAGTTCTTATAGAGCAGTTTGGATGGTTCATCG 
CTGACCCACTCTGTTCTCTTTCTACTGCTATATTAATATTTCTCAGTGTTGTTCCACTGATTA 
AAGATGCCTGCCAGGTTCTACTCCTGAGATTGCCACCAGAATATGAAAAAGAACTACATATTG 
CTTTAGAAAAGATACAGAAAATTGAAGGATTAATATCATACCGAGACCCTCATTTTTGGCGTC 
ATTCTGCTAGTATTGTGGCAGGAACAATTCATATACAGGTGACATCTGATGTGCTAGAACAAA 
GAATAGTACAGCAGGTTACAGGAATACTTAAAGATGCTGGAGTAAACAATTTAACAATTCAAG 
TGGAAAAGGAGGCATACTTTCAACATATGTCTGGCCTAAGTACTGGATTTCATGATGTTCTGG 
CTATGACAAAACAAATGGAATCCATGAAATACTGCAAAGATGGTACTTACATCATGTGAGATA 
ACTCAAGAATTACCCCTGGAGAATAAACAATGAAGATTAAATGACTCAGTATTTGTAATATTG 
CCAGAAGGATAAAAATTACACATTAACTGTACAGAAACAGAGTTCCCTACTACTGGATCAAGG 
AATCTTTCTTGAAGGAAATTTAAATACAGAATGAAACATTAATGGTAAAAAAAA 
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FIGURE 28 

MEEKYGGDVLAGPGGGGGLGPVDVPSARLTKYIVLLCFTKFLKAVGLFESYDLLKAVHIVQFI 
FILKLGTAFFMVLFQKPFSSGKTITKHQWIKIFKHAVAGCIISLLWFFGLTLCGPLRTLLLFE 
HSDIVVISLLSVLFTSSGGGPAKTRGAAFFIIAVICLLLFDNDDLMAKMAEHPEGHHDSALTH 
MLYTAIAFLGVADHKGGVLLLVLALCCKVGFHTASRKLSVDVGGAKRLQALSHLVSVLLLCPW 
VIVLSVTTESKVESWFSLIMPFATVIFFVMILDFYVDSICSVKMEVSKCARYGSFPIFISALL 
FGNFWTHPITDQLRAMNKAAHQESTEHVLSGGVWSAIFFILSANILSSPSKRGQKGTLIGYS 
PEGTPLYNFMGDAFQHSSQSIPRFIKESLKQILEESDSRQIFYFLCLNLLFTFVELFYGVLTN 
SLGLISDGFHMLFDCSALVMGLFAALMSRWKATRIFSYGYGRIEILSGFINGLFLIVIAFFVF 
MESVARLIDPPELDTHMLTPVSVGGLIVNLIGICAFSHAHSHAHGASQGSCHSSDHSHSHHMH 
GHSDHGHGHSHGSAGGGMNANMRGVFLHVLADTLGSIGVIVSTVLIEQFGWFIADPLCSLSTA 
ILIFLSVVPLIKDACQVLLLRLPPEYEKELHIALEKIQKIEGLISYRDPHFWRHSASIVAGTI 
HIQVTSDVLEQRIVQQVTGILKDAGVNNLTIQVEKEAYFQHMSGLSTGFHDVLAMTKQMESMK 
YCKDGTYIM 

Important features of the protein: 
Signal peptide: 

amino acids 1-4 6 
Transmembrane domains: 

amino acids 59-77, 101-119, 150-167, 205-223, 239-258, 267-284, 
305-324, 343-360, 421-440, 452-469, 486-505, 522-539, 592-612, 
621-641 

N-glycosylation site. 

amino acids 721-725 
Glycosaminoglycan attachment site. 

amino acids 143-147 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 225-229 

Tyrosine kinase phosphorylation sites. 

amino acids 750-758, 756-764 
N-myristoylation sites. 

amino acids .14-20, 46-52, 102-108, 112-118, 144-150, 317-323, 
347-353, 369-375, 372-378, 437-443, 462-468, 529-535, 549-555, 
553-559, 579-585, 582-588, 583-589, 584-590, 605-611, 737-743 
Multicopper oxidases protein: 
amino acids 561-569 
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FIGURE 29 

GGCACGAGGGCAGGATATTAGAAATGGCTACTCCCCAGTCAATTTTCATCTTTGCAATCTGCA 
TTTTAATGATAACAGAATTAATTCTGGCCTCAAAAAGCTACTATGATATCTTAGGTGTGCCAA 
AATCGGCATCAGAGCGCCAAATCAAGAAGGCCTTTCACAAGTTGGCCATGAAGTACCACCCTG 
ACAAAAATAAGAGCCCGGATGCTGAAGCAAAATTCAGAGAGATTGCAGAAGCATATGAAACAC 
TCTCAGATGCTAATAGACGAAAAGAGTATGATACACTTGGACACAGTGCTTTTACTAGTGGTA 
AAGGACAAAGAGGTAGTGGAAGTTCTTTTGAGCAGTCATTTAACTTCAATTTTGATGACTTAT 
TTAAAGACTTTGGCTTTTTTGGTCAAAACCAAAACACTGGATCCAAGAAGCGTTTTGAAAATC 
ATTTCCAGACACGCCAGGATGGTGGTTCCAGTAGACAAAGGCATCATTTCCAAGAATTTTCTT 
TTGGAGGTGGATTATTTGATGACATGTTTGAAGATATGGAGAAAATGTTTTCTTTTAGTGGTT 
TTGACTCTACCAATCAGCATACAGTACAGACTGAAAATAGATTTCATGGATCTAGCAAGCACT 
GCAGGACTGTCACTCAACGAAGAGGAAATATGGTTACTACATACACTGACTGTTCAGGACAGT 
AGTTCTTATTCTATTCTCACTAAATCCAACTGGTTGACTCTTCCTCATTATCTTTGATGCTAA 
ACAATTTTCTGTGAACTATTTTGACAAGTGCATGATTTCACTTTAAAC7\ATTTGATATAGCTA 
TTAAATATATTTAAGGGTTTTTTTTTTTGACAAATTCAACATTCAACGAGTAGACAAAATGCT 
AATTATTTCCCTGATTAGGAAAGTTTCTTTAAAAAACACGTAATTTTGCCTAGTGCTTTTTCT 
CTACCTGCCCTTGGGCTCACTAATATCACCAGTATTATTACCAAGAAAATATTGAGTTTACCT 
GATTAAACTTTAAAAGTTAATTGTAGATTTAAATTGTGTGAACCTAATGATTTTTGCAGTGAA 
ACCTTTACTAATTCAAAGTTGCATGTTCTATGACATCTGTGACTTGCGTTGCAGAGTGTACAT 
GAAACTGTATAATTGAGTCATTCAGTAAAGGAGAACAGTATCTTGGTTAATTGCTACTGAAAG 
GTTGAGAAAGGAATGGTTTGATATTTACCACAGCGCTGTGCCTTTCTACAGTAGAACTGGGGT 
AAAGGAAATGGTTTTATTGCCCATAGTCATTTAGGCTGGAAAAAAGTTGAAAACTTAACGAAA 
TATTGCCAAGAGATTGTTATGTGTTTGGTTCCAGCCTAAAAATGATTTTGTAGTGTTGAAATC 
ATAGCTACTTACATAGCTTTTTCATATTTCTTTCTTAGTTGTTGGCACTCTTAGGTCTTAGTA 
TGGATTTATGTGTTTGTGTGTGTGTAGTTTA.TCCTCTCTCTCATCTTTATCTAGAGATTGACT 
GATACCTCATTCTGTTTGTAAAACCAGCCAGTAATTTCTGTGCAACCTTACTATGTGCAATAT 
TTTTAAATCCTGAGAAATGTGTGCTTTTGTTTTCGGATAGACTTATTTCTTTAGTTCTGCACT 
TTTCCACATTATACTCCATATGAGTATTAATCCTATGGATACATATTAAAACAAGTGTCTCAT 
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FIGURE 30 

MATPQSIFIFAICILMITELILASKSYYDILGVPKSASERQIKKAFHKLAMKYHPDKNKSPDA 
EAKFREIAEAYETLSDANRRKEYDTLGHSAFTSGKGQRGSGSSFEQSFNFNFDDLFKDFGFFG 
QNQNTGSKKRFENHFQTRQDGGSSRQRHHFQEFSFGGGLFDDMFEDMEKMFSFSGFDSTNQHT 
VQTENRFHGSSKHCRTVTQRRGNMVTTYTDCSGQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

Nt-dnaJ domain signature. 

amino acids 27-59, 66-90 

Glycosaminoglycan attachment site. 

amino acids 96-100 

N-myristoylation sites . 

amino acids 32-38, 99-105, 102-108, 126-132, 211-217 
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FIGURE 31 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTGG 
GCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACAAT 
TCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCTGAG 
ATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCAAATG 
CAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCTACGCA 
TTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTCTCTGTA 
CTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGAAACAGTG 
TACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCTGGATCCCC 
AGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATCACGGCCACT 
GTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTGGAGCATCCTG 
AAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATGGAGATCACCAAAGAT 
GGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTGCTTGTGGCCTAC 
TGGAGGAGGGAGCCTGGTGCCGAGGAACATGTCAAAATGGTGAGGAGTGGGGGTATTGCAGTG 
CACCTAGAAACCATGGAGCCAGGGGCTGCATACTGTGTGAAGGCCCAGACATTCGTGAAGGCC 
ATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAAGGAGAGGCCATTCCC 
CTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGTGGTCGTGCCACTGTTC 
GTCTGGAAAATGGGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGGTGGTCCTCCCAGACACC 
TTGAAAATAACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGGGAGGAGGTGGATGCCTGT 
GCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGATCTCA3AGGTTTGCGGAAG.G 
GCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACCATGAGGGGACAAGTTGTGTT 
TCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGAGCCTGTTGTCTACAAGTCTAG 
AAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACACTGACTGAGGCTTAGGGGATGTG 
ACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACCCTGGGAAAAGTGACTTCATCCCT 
TCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACCTACACACCTGCTAAACACACACAC 
ACAGAGTCTCTCTCTATATATACACACGTACACATAAATACACCCAGCACTTGCAAGGCTAGA 
GGGAAACTGGTGACACTCTACAGTCTGACTGATTCAGTGTTTCTGGAGAGCAGGACATAAATG 
TATGATGAGAATGATCAAGGACTCTACACACTGGGTGGCTTGGAGAGCCCACTTTCCCAGAAT 
AATCCTTGAGAGAAAAGGAATCATGGGAGCAATGGTGTTGAGTTCACTTCAAGCCCAATGCCG 
GTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGTAGGTGACCTGGAGGAAGGTCACAGCCACA 
CTGAAAATGGGATGTGCATGAACACGGAGGATCCATGAACTACTGTAAAGTGTTGACAGTGTG 
TGCACACTGCAGACAGCAGGTGAAATGTATGTGTGCAATGCGACGAGAATGCAGAAGTCAGTA 
ACATGTGCATGTTTGTTGTGCTCCTTTTTTCTGTTGGTAAAGTACAGAATTCAGCAAATAAAA 
AGGGCCACCCTGGCCAAAAGCGGTAAAAAAAAAAAAAAAA 
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FIGURE 32 

MQTFTMVLEEIWTSLFMWFFYALIPCLLTDEVAILPAPQNLSVLSTNMKHLLMWSPVIAPGET 
VYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQTSAWSI 
LKHPFNRNSTILTRPGMEITKDGFHLVIELEDLGPQFEFLVAYWRREPGAEEHVPCMVRSGGIP 
VHLETMEPGAAYCVKAQTFVKAIGRYSAFSQTECVEVQGEAIPLVLALFAFVGFMLILVWPL 
FVWKMGRLLQYSCCPVVVLPDTLKITNSPQKLISCRREEVDACATAVMSPEELLRAWIS 

Important features: 
Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 230-255 

N-glycosylation sites . 

amino acids 40-44, 134-138 

Tissue factor proteins . 

amino acids 92-120 

Integrins alpha chain proteins. 

amino acids 232-263 
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FIGURE 33 

GAGACACGCGAGCGGGGAGACCTCCAAGGCAGCGAGGCATCGGACATGTGTCAGCACATCTGG 
GGCGCACATCCGTCGAGCCCGAGGGGAGATTTGCCGGAACAATTCAAACTGCGATATTGATCT 
TGGGGGTGACTGTCCCTGGCCGGCTGTCGGGTGGGAGTGCGAGTGTGCACTCGCTCGGAAGTG 
TGTGCGAGTGTGTATGTGTGTGTGCCGTGTCGGGCTCCCCCCTTCCCCCCGTTTTCCCGTCGA 
GTGATGCACTTGGAATGAGAATCAGAGG ATG GAAATAGTCTGGGAGGTGCTTTTTCTTCTTCA 
AGCCAATTTCATCGTCTGCATATCAGCTCAACAGAATTCACCAAAAATCCATGAAGGCTGGTG 
GGCATACAAGGAGGTGGTCCAGGGAAGCTTTGTTCCAGTTCCTTCTTTCTGGGGATTGGTGAA 
CTCAGCTTGGAATCTTTGCTCTGTGGGGAAACGGCAGTCGCCAGTCAACATAGAGACCAGTCA 
CATGATCTTCGACCCCTTTCTGACACCTCTTCGCATCAACACGGGGGGCAGGT^AGGTCAGTGG 
GACCATGTACAACACTGGAAGACACGTATCCCTTCGCCTGGACAAGGAGCACTTGGTCAACAT 
ATCTGGAGGGCCCATGACATACAGCCACCGGCTGGAGGAGATCCGACTACACTTTGGGAGTGA 
GGACAGCCAAGGGTCGGAGCACCTCCTCAATGGACAGGCCTTCTCTGGGGAGGTGCAGCTCAT 
CCACTATAACCATGAGCTATATACGAATGTCACAGT^AGCTGCAAAGAGTCCAAATGGATTGGT 
GGTAGTTTCTATATTTATAAAAGTTTCTGATTCATCAAACCCATTTCTTAATCGAATGCTCAA 
CAGAGATACTATCACAAGAATAACATATAAAAATGATGCATATTTACTACAGGGGCTTAATAT 
AGAGGAACTATATCCAGAGACCTCTAGTTTCATCACTTACGATGGGTCGATGACTATCCCACC 
CTGCTATGAGACAGCAAGTTGGATCATAATGAACAAACCTGTCTATATAACCAGGATGCAGAT 
GCATTCCTTGCGCCTGCTCAGCCAGAACCAGCCATCTCAGATCTTTCTGAGCATGAGTGACAA 
CTTCAGGCCTGTCCAGCCACTCAACAACCGCTGCATCCGCACCAATATCAACTTCAGTTTACA 
GGGGAAGGACTGT.CCAAACAACCGAGCCCAGAAGCTTCAGTATAGAGTAAATGAATGGCTCCT 
CAAG TAG GGAACAAAGCCAAGAAGAATCCCACCTCAGTGAAATGCTACAACTGTGAATTGACG 
TAACCTAGAATGTCCCCCTTCTTGCTTCTCTCTCCTTCTTTCCCCCAAGCCTCATTCATTCTT 
GGGATTGGCCCTTTCTTCATGAAAAGTGTCTGCGAAACCATGGCAGAGGAATACATCTCTCAC 
ACATACTCACAAACACACACACAAGCACTTGCACATACATACAAACACATGCAAACATACCTA 
CACACACACACTCTCTTACAACCTCCATCATGGGAAGTCAAGTTTCAGAAACAAAAGTCTCAT 
TCATAAGAGGTCTTAGAAGAAAATAACCAGTTAACCTGATTTCAATTTTGATACCGTTTTCCT 
GAACTAATAAATCTACCCAATGAGACTTTTCAGCCTTTGTACATACAAAATTCTTCCAAAAGA 
GAGAGGAGAAAATACAGCTCTGATGGCATCAAACGGACTTTGCATCAAGTT^ATTTCAGATAGT 
GTCCTAGGATCCTTTGAGGGTGCTGGTAGCAGGTGAGCAGGACAAAGTTGACCAAGGACACTT 
ATTTCTAGATTATGATTCTTCTGTTTACTCAAC7\ATTTACAAAGAAAAAAAGGACAGACATTG 
AAGAGCTACACATTGTATATATATCACCACAGACTATAAGGAAATGGAATTATTTCCCTCTTT 
GTCACATATCTGTAGTAGGATTTGCCAAGATCAGAAATGATCCATTTGCTGTTTCTTGTTTTC 
CAAAGGTCATACATTGTGTTTGGTTATTGTTACCAGCTCAATAAATGTGTTTAACGAGTTAAT 
TTCATTTTTCTGGCTTTGGTCTGTTCTCCTTCCTTACAGGCTAAGCCCTGGCTCCATGCAACT 
GCATTCTTTGATTTCACTTGTTCCTTCATCTACATGTTTTGTTCATTTGCAGCCAGTTTTTAC 
TGAGTTTGTGGCAATCAGGAATGCATTTGCTAAGCAAGTATGACTTTAATTCCACTCCATGGC 
TCAATCATTCACATGAGGTGAGCTTCAGCCTGAGATAGCAGGCGACAGACTTCTTGCGTTTCA 
AAACTGCCATGCCCCCCTGTGATGCTCCCGTGAAGGAATGCACTTTGCCTTGTAAGTTCCTGG 
GAAAGGGGTATGTTTTCTCTCCAGGTGCAGCCAGATCTCACAAAGTACAAAACGAATGCCTTT 
CTTTTCTTGTTTATAATGGTCACTCACTGTGTTTGGTTACTGTCAAGAAATCAATAAATGTGT 
TTAACAAGTTA 
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FIGURE 34 

MEIVWEVLFLLQANFIVCISAQQNSPKIHEGWWAYKEVVQGSFVPVPSFWGLVNSAWNLCSVG 
KRQSPVNIETSHMIFDPFLTPLRINTGGRKVSGTMYNTGRHVSLRLDKEHLVNISGGPMTYSH 
RLEEIRLHFGSEDSQGSEHLLNGQAFSGEVQLIHYNHELYTNVTEAAKSPNGLVWSIFIKVS 
DSSNPFLNRMLNRDTITRITYKNDAYLLQGLNIEELYPETSSFITYDGSMTIPPCYETASWII 
MNKPVYITRMQMHSLRLLSQNQPSQIFLSMSDNFRPVQPLNNRCIRTNINFSLQGKDCPNNRA 
QKLQYRVNEWLLK 

Important features: 
Signal peptide: 

amino acids 1-20 

Eukaryo tic -type carbonic anhydrases proteins. 

amino acids 126-162, 220-269, 43-91 

N-glycosylation sites. 

amino acids 116-119, 168-171, 302-305 
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FIGURE 35 

GTCGGAACCCCCTCAGGCCACCCTCGGGAGTCCTGGGGTCCAGAGGGGTGTCCCTGTACCCCTTGCAC 
ACAGGACCCTCACTCTGCAGGGATAAGCCAGCTGCGCCTGCAGCCTAGGGTGCCAAGGAGGCTGCTGA 
TTGTGGCCCACAGCCTCATCTGAACGCCAGGAGACCAGGATACCGAGGCACCGGATCCCCTCTCTGTG 
CCCTGGGGAGCCCCAGTGCTGCCCAGTCACCCCAGGGCTGAGGTCTGCGTCCCTAGTGGTGCAAGGCC 
TGGTAGGACCACGGGGCAGGGAATGTGAGCGCCATCCGAGCTCACGGTGTCCTGAGTCGCGGCTTCGT 
GACTTTGGCAGGGGCCTCCGGACCAGTGACCCCAGTCAAACCCAGAGGGTCTTGGGCGGCAGCGACGA 
AGGAGGTATTCAGGCTCCAGGCCAGGTGGGGCCGGACGCCCCCAGCCATCCACCATGGTGGTGGCACA 
CCCCACCGCCACTGCCACCACCACGCCCACTGCCACTGTCACGGCCACCGTTGTGATGACCACGGCCA 
CCATGGACCTGCGGGACTGGCTGTTCCTCTGCTACGGGCTCATCGCCTTCCTGACGGAGGTCATCGAC 
AGCACCACCTGCCCCTCGGTGTGCCGCTGCGACAACGGCTTCATCTACTGCAACGACCGGGGACTCAC 
ATCCATCCCCGCAGATATCCCTGATGACGCCACCACCCTCTACCTGCAGAACAACCAGATCAACAACG 
CCGGCATCCCCCAGGACCTCAAGACCAAGGTCAACGTGCAGGTCATCTACCTATACGAGAATGACCTG 
GATGAGTTCCCCATCAACCTGCCCCGCTCCCTCCGGGAGCTGCACCTGCAGGACAACAATGTGCGCAC 
CATTGCCAGGGACTCGCTGGCCCGCATCCCGCTGCTGGAGAAGCTGCACCTGGATGACAACTCCGTGT 
CCACCGTCAGCATTGAGGAGGACGCCTTCGCCGACAGCAAACAGCTCAAGCTGCTCTTCCTGAGCCGG 
AACCACCTGAGCAGCATCCCCTCGGGGCTGCCGCACACGCTGGAGGAGCTGCGGCTGGATGACAACCG 
CATCTCCACCATCCCGCTGCATGCCTTCAAGGGCCTCAACAGCCTGCGGCGCCTGGTGCTGGACGGTA 
ACCTGCTGGCCAACCAGCGCATCGCCGACGACACCTTCAGCCGCCTACAGAACCTCACAGAGCTCTCG 
CTGGTGCGCAATTCGCTGGCCGCGCCACCCCTCAACCTGCCCAGCGCCCACCTGCAGAAGCTCTACCT 
GCAGGACAATGCCATCAGCCACATCCPCTACAACACGCTGGCCAAGATGCGTGAGCTGGAGCGGCTGG 
ACCTGTCCAACAACAACCTGACCACGCTGCCCCGCGGCCTGTTCGACGACCTGGGGAACCTGGCCCAG 
CTGCTGCTCAGG7^ACAACCCTTGGTTTTGTGGCTGCAACCTCATGTGGCTGCGGGACTGGGTGAAGGC 
ACGGGCGGCCGTGGTCAACGTGCGGGGCCTCATGTGCCAGGGCCCTGAGAAGGTCCGGGGCATGGCCA 
TCAAGGACATTACCAGCGAGATGGACGAGTGTTTTGAGACGGGGCCGCAGGGCGGCGTGGCCAATGCG 
GCTGCCAAGACCACGGCCAGCAACCACGCCTCTGCCACCACGCCCCAGGGTTCCCTGTTTACCCTCAA 
GGCCAAAAGGCCAGGGCTGCGCCTCCCCGACTCCAACATTGACTACCCCATGGCCACGGGTGATGGCG 
CCAAGACCCTGGCCATCCACGTGAAGGCCCTGACGGCAGACTCCATCCGCATCACGTGGAAGGCCACG 
CTCCCCGCCTCCTCTTTCCGGCTCAGTTGGCTGCGCCTGGGCCACAGCCCAGCCGTGGGCTCCATCAC 
GGAGACCTTGGTGCAGGGGGACAAGACAGAGTACCTGCTGACAGCCCTGGAGCCCAAGTCCACCTACA 
TCATCTGCATGGTCACCATGGAGACCAGCAATGCCTATGTAGCTGATGAGACACCCGTGTGTGCCAAG 
GCAGAGACAGCCGACAGCTATGGCCCTACCACCACACTCAACCAGGAGCAGAACGCTGGCCCCATGGC 
GAGCCTGCCCCTGGCGGGCATCATCGGCGGGGCAGTGGCTCTGGTCTTCCTCTTCCTGGTCCTGGGGG 
CCATCTGCTGGTACGTGCACCAGGCTGGCGAGCTGCTGACCCGGGAGAGGGCCTACAACCGGGGCAGC 
AGG71AAAAGGATGACTATATGGAGTCAGGGACCAAGAAGGATAACTCCATCCTGGAAATCCGCGGCCC 
TGGGCTGCAGATGCTGCCCATCAACCCGTACCGCGCCAAAGAGGAGTACGTGGTCCACACTATCTTCC 
CCTCCAACGGCAGCAGCCTCTGC7VAGGCCACACACACCATTGGCTACGGCACCACGCGGGGCTACCGG 
GACGGCGGCATCCCCGACATAGACTACTCCTACACATGATGCCCGCCCACCCGGGCTGCCCCGCCTCA 
GCCCCAGCTGCCCTGGCGTGGCCATGTGGCTTTGCCCAGCCTGCTGCAATCCAAGAGAGCAAGGAAGA 
GAAATTCCATGGGTGACTTTCCTCCGCAGAAAGCAAAGTTTGGGGAGGGCTGACGATTTTGTAGAACA 
CAACAGTGACAATTTTTTTTAAAAGAATAGAAGGCAGGAGGGGGAATTCGACATT.GTTGAAGACATAA 
TTTATACCAAGTTATGCCAGTTGGGGAGGGAAGGACTAAAT^ATAATATTGCAGGCAGGGCTGGGTTGG 
GTTTTTTTTTTTCCCCCCTGAACTGGAAGGATACTACCTGTACAACATCTGTGGACACCTCATGCTCT 
GTTCAAGGCCATCACAAAGGAACCGCCAGGGAGAAGCAGCCGGCTCTCAAAGCTCCCACGCAGCTCTC 
CCGCCACTGGCCACTCGCTGGCGACCCGATGGAAGGTTTTCAGGCTCCTCACAAAGGAGAGAGGGAAG 
AAAAGATCTTTTGCCCTGGAGATATGGTCCTGAAATCTCTCCCCTGGCTTATTCCATACCATTTCCCT 
TGCAGATTTGCAGAAACATGGCATCTTTCACTGCATTCTTTGAACAATCATGTAGTCGATTAAAAAAA 
AA7VACAAACTTTTTTTTCCTAGGCTGAAGCCCTCTTCAGTTCCATGCACCACGCTCCGTAGAAGCCCC 
GGCGGAAGCCGTAGCTTTCCCTGCCACCTGGAGGTGCATCTGTCTGCCTGTCTATCCCTGTCGCGGTG 
TCTCTAAGTACAGATGGGTAGATAGAGCCACATGCACGGTCCTTACCGTTCTTCTTGGGTCAGTTCTT 
ACCATTTCCTGAACAATAGAATTGTGAAAGTGTTAAAAA 
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FIGURE 36 

MWAHPTATATTTPTATVTATVVMTTATMDLRDWLFLCYGLIAFLTEVIDSTTCPSVCRCDNG 
FI YCNDRGLTS I PADI PDDATTLYLQNNQINNAGI PQDLKTKVNVQVI YLYENDLDEFPINLP 
RSLRELHLQDNNVRTIARDSLARIPLLEKLHLDDNSVSTVSIEEDAFADSKQLKLLFLSRNHL 
SSIPSGLPHTLEELRLDDNRISTIPLHAFKGLNSLRRLVLDGNLLANQRIADDTFSRLQNLTE 
LSLVRNSLAAPPLNLPSAHLQKLYLQDNAISHIPYNTLAKMRELERLDLSNNNLTTLPRGLFD 
DLGNLAQLLLRNNPWFCGCNLMWLRDWVKARAAVVNVRGLMCQGPEKVRGMAIKDITSEMDEC 
FETGPQGGVANAAAKTTASNHASATTPQGSLFTLKAKRPGLRLPDSNIDYPMATGDGAKTLAI 
HVKALTADSIRITWKATLPASSFRLSWLRLGHSPAVGSITETLVQGDKTEYLLTALEPKSTYI 
ICMVTMETSNAYVADETPVCAKAETADSYGPTTTLNQEQNAGPMASLPLAGIIGGAVALVFLF 
LVLGAICWYVHQAGELLTRERAYNRGSRKKDDYMESGTKKDNSILEIRGPGLQMLPINPYRAK 
EEYWHTI FPSNGSSLCKATHTIGYGTTRGYRDGGI PDI DYSYT 

Important features of the protein: 
Transmembrane domain: 

amino acids 552-573 

N-glycosylation sites. 

amino acids 249-252, 305-308, 642-645 

Leucine zipper pattern . 

amino acids 182-203, 299-320 

Phospholipase A2 aspartic acid active site. 

amino acids 57-67 
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FIGURE 37 

GGTGACTGAAGCGAGCCTGGCCTCTTGCATCCTCCGCCTGTGTACCTCCCTCCCCTTTTTTTCCGCCT 

TCTGCCAGCAGAAGCAGCAGCCGCAGCACCTGAGCCGCTACTGCCGCTCACTCAGGACAACGCTATGG 

CTGAGCCTGGGCACAGCCACCATCTCTCCGCCAGAGTCAGGAGAAGAACTGAGAGGCGCATACCCCGG 

CTGTGGCGGCTGCTGCTCTGGGCTGGGACCGCCTTCCAGGTGACCCAGGGAACGGGACCGGAGCTTCA 

TGCCTGCAAAGAGTCTGAGTACCACTATGAGTACACGGCGTGTGACAGCACGGGTTCCAGGTGGAGGG 

TCGCCGTGCCGCATACCCCGGGCCTGTGCACCAGCCTGTCTGACCCCGTCAAGGGCACCGAGTGCTCC 

TTCTCCTGCAACGCCGGGGAGTTTCTGGATATGAAGGACCAGTCATGTAAGCCATGCGCTGAGGGCCG 

CTACTCCCTCGGCACAGGCATTCGGTTTGATGAGTGGGATGAGCTGCCCCATGGCTTTGCCAGCCTCT 

CAGCCAACATGGAGCTGGATGACAGTGCTGCTGAGTCCACCGGGAACTGTACTTCGTCCAAGTGGGTT 

CCCCGGGGCGACTACATCGCCTCCAACACGGACGAATGCACAGCCACACTGATGTACGCCGTCAACCT 

GAAGCAATCTGGCACCGTTAACTTCGAATACTACTATCCAGACTCCAGCATCATCTTTGAGTTTTTCG 

TTCAGAATGACCAGTGCCAGCCCAATGCAGATGACTCCAGGTGGATGAAGACCACAGAGAAAGGATGG 

GAATTCCACAGTGTGGAGCTAAATCGAGGCAATAATGTCCTCTATTGGAGAACCACAGCCTTCTCAGT 

ATGGACCAAAGTACCCAAGCCTGTGCTGGTGAGAAACATTGCCATAACAGGGGTGGCCTACACTTCAG 

AATGCTTCCCCTGCAAACCTGGCACGTATGCAGACAAGCAGGGCTCCTCTTTCTGCAAACTTTGCCCA 

GCCAACTCTTATTCAAATAAAGGAGAAACTTCTTGCCACCAGTGTGACCCTGACAAATACTCAGAGAA 

AGGATCTTCTTCCTGTAACGTGCGCCCAGCTTGCACAGACAAAGATTATTTCTACACACACACGGCCT 

GCGATGCCAACGGAGAGACACAACTCATGTACAAATGGGCCAAGCCGAAAATCTGTAGCGAGGACCTT 

GAGGGGGCAGTGAAGCTGCCTGCCTCTGGTGTGAAGACCCACTGCCCACCCTGCAACCCAGGCTTCTT 

CAAAACCAACAACAGCACCTGCCAGCCCTGCCCATATGGTTCCTACTCCAATGGCTCAGACTGTACCC 

GCTGCCCTGCAGGGACTGAACCTGCTGTGGGATTTGAATACAAATGGTGGAACACGCTGCCCACAAAC 

ATGGAAACGACCGTTCTCAGTGGGATCT^ACTTCGAGTACAAGGGCATGACAGGCTGGGAGGTGGCTGG 

TGATCACATTTACACAGCTGCTGGAGCCTCAGACAATGACTTCATGATTCTCACTCTGGTTGTGCCAG 

GATTTAGACCTCCGCAGTCGGTGATGGCAGACACAGAGAATAAAGAGGTGGCCAGAATCACATTTGTC 

TTTGAGACCCTCTGTTCTGTGAACTGTGAGCTCTACTTCATGGTGGGTGTGAATTCTAGGACCAACAC 

TCCTGTGGAGACGTGGAAAGGTTCCAAAGGCAAACAGTCCTATACCTACATCATTGAGGAGAACACTA 

CCACGAGCTTCACCTGGGCCTTCCAGAGGACCACTTTTCATGAGGCAAGCAGGAAGTACACCAATGAC 

GTTGCCAAGATCTACTCCATCAATGTCACCAATGTTATGAATGGCGTGGCCTCCTACTGCCGTCCCTG 

TGCCCTAGAAGCCTCTGATGTGGGCTCCTCCTGCACCTCTTGTCCTGCTGGTTACTATATTGACCGAG 

ATTCAGGAACCTGCCACTCCTGCCCCCCTAACACAATTCTGAAAGCCCACCAGCCTTATGGTGTCCAG 

GCCTGTGTGCCCTGTGGTCCAGGGACCAAGAACAACAAGATCCACTCTCTGTGCTACAATGATTGCAC 

CTTCTCACGCAACACTCCAACCAGGACTTTCAACTACAACTTCTCCGCTTTGGCAAACACCGTCACTC 

TTGCTGGAGGGCCAAGCTTCACTTCCAAAGGGTTGAAATACTTCCATCACTTTACCCTCAGTCTCTGT 

GGAAACCAGGGTAGGAAAATGTCTGTGTGCACCGACAATGTCACTGACCTCCGGATTCCTGAGGGTGA 

GTCAGGGTTCTCCAAATCTATCACAGCCTACGTCTGCCAGGCAGTCATCATCCCCCCAGAGGTGACAG 

GCTACAAGGCCGGGGTTTCCTCACAGCCTGTCAGCCTTGCTGATCGACTTATTGGGGTGACAACAGAT 

ATGACTCTGGATGGAATCACCTCCCCAGCTGAACTTTTCCACCTGGAGTCCTTGGGAATACCGGACGT 

GATCTTCTTTTATAGGTCCAATGATGTGACCCAGTCCTGCAGTTCTGGGAGATCAACCACCATCCGCG 

TCAGGTGCAGTCCACAGAAAACTGTCCCTGGAAGTTTGCTGCTGCCAGGAACGTGCTCAGATGGGACC 

TGTGATGGCTGCT^ACTTCCACTTCCTGTGGGAGAGCGCGGCTGCTTGCCCGCTCTGCTCAGTGGCTGA 

CTACCATGCTATCGTCAGCAGCTGTGTGGCTGGGATCCAGANGACTACTTACGTGTGNCGAGAACCCA 

AGCTATGCTCTGGTGGCATTTCTCTGCCTGAGCAGAGAGTCACCATCTGCAAAACCATAGATTTCTGG 

CTGAAAGTGGGCATCTCTGCAGGCACCTGTACTGCCATCCTGCTCACCGTCTTGACCTGCTACTTTTG 

GAAAAAGAATCAAAAACTAGAGTACAAGTACTCCAAGCTGGTGATGAATGCTACTCTCAAGGACTGTG 

ACCTGCCAGCAGCTGACAGCTGCGCCATCATGGAAGGCGAGGATGTAGAGGACGACCTCATCTTTACC 

AGCAAGAAGTCACTTTTTGGGAAGATCAAATCATTTACCTCCAAGAGGACTCCTGATGGATTTGACTC 

AGTGCCGCTGAAGACATCCTCAGGAGGCCCAGACATGGACCTGTGAGAGGCACTGCCTGCCTCACCTG 

CCTCCTCACCTTGCATAGCACCTTTGCAAGCCTGCGGCGATTTGGGTGCCAGCATCCTGCAACACCCA 

CTGCTGGAAATCTCTTCATTGTGGCCTTATCAGATGTTTGAATTTCAGATCTTTTTTTATAGAGTACC 

CAAACCCTCCTTTCTGCTTGCCTCAAACCTGCCAAATATACCCACATTTTTTTTTTVAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MAEPGHSHHLSARVRRRTERRIPRLWRLLLWAGTAFQVTQGTGPELHACKESEYHYEYTACDS 
TGSRWRVAVPHTPGLCTSLSDPVKGTECSFSCNAGEFLDMKDQSCKPCAEGRYSLGTGIRFDE 
WDELPHGFASLSANMELDDSAAESTGNCTSSKWVPRGDYIASNTDECTATLMYAVNLKQSGTV 
NFEYYYPDSSIIFEFFVQNDQCQPNADDSRWMKTTEKGWEFHSVELNRGNNVLYWRTTAFSVW 
TKVPKPVLVRNIAITGVAYTSECFPCKPGTYADKQGSSFCKLCPANSYSNKGETSCHQCDPDK 
YSEKGSSSCNVRPACTDKDYFYTHTACDANGETQLMYKWAKPKICSEDLEGAVKLPASGVKTH 
CPPCNPGFFKTNNSTCQPCPYGSYSNGSDCTRCPAGTEPAVGFEYKWWNTLPTNMETTVLSGI 
NFEYKGMTGWEVAGDHIYTAAGASDNDFMILTLVVPGFRPPQSVMADTENKEVARITFVFETL 
CSVNCELYFMVGVNSRTNTPVETWKGSKGKQSYTYIIEENTTTSFTWAFQRTTFHEASRKYTN 
DVAKIYSINVTNVMNGVASYCRPCALEASDVGSSCTSCPAGYYIDRDSGTCHSCPPNTILKAH 
QPYGVQACVPCGPGTKNNKIHSLCYNDCTFSRNTPTRTFNYNFSALANTVTLAGGPSFTSKGL 
KYFHHFTLSLCGNQGRKMSVCTDNVTDLRIPEGESGFSKSITAYVCQAVIIPPEVTGYKAGVS 
SQPVSLADRLIGVTTDMTLDGITSPAELFHLESLGIPDVIFFYRSNDVTQSCSSGRSTTIRVR 
CSPQKTVPGSLLLPGTCSDGTCDGCNFHFLWESAAACPLCSVADYHAIVSSCVAGIQXTTYVX 
REPKLCSGGISLPEQRVTICKTIDFWLKVGISAGTCTAILLTVLTCYFWKKNQKLEYKYSKLV 
MNATLKDCDLPAADSCAIMEGEDVEDDLIFTSKKSLFGKIKSFTSKRTPDGFDSVPLKTSSGG 
PDMDL 



Important features of the protein: 
N-glycosylation sites : 

amino acids 153-156, 390-393, 391-394, 404-407, 544-547, 576-579, 
672-675, 717-720, 947-950 

cAMP- and cGMP- dependent protein kinase phosphorylation sites: 

amino acids 15-18, 563-566, 709-712 
Casein kinase II phosphorylation sites : 

amino acids 42-45, 59-62, 81-84, 146-149, 168-171, 282-285, 331- 
334, 340-343, 431-434, 449-452, 465-468, 523-526, 557-560, 761- 
764, 780-783, 835-838, 860-863, 893-896, 949-952 
Tyrosine kinase phosphorylation sites: 
amino acids 50-56, 109-116 
N-myristoylation sites: 

amino acids 77-82, 88-93, 152-157, 268-273, 288-293, 320-325, 

400-405, 405-410, 414-419, 463-468, 599-604, 616-621, 634-639, 

644-649, 839-844, 874-879, 912-917, 916-921 

Amidation site: 

amino acids 707-710 

Cell attachment sequence: 

amino acids 162-164 
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FIGURE 39 

GGGAAGGGGTTCTGGGCTGCCGCAGGCACACAGGCCAGAGCTTCGTGGATACCTGCAGGGCCC 
AAAGGTCCCTCCCTGTTTTGAAGAGTGAGTGATGGCTATGAGGTAGCGGCCAGGCTGATCACC 
CCTGCGTTGGCTGGAGGCAGAATTCTGTAAATCCTCGCCAAGTCTTTCTCCAGGCCACTGGTT 
AGCTCATCTCAGCCTCCTCTGGGAGCATCAACACCAACATGGCACAGGGGACTGCAGTGGTGT 
GCTTTGGACCTGTGTACCCACCCAAGGCTAAAGGCAGAGCCAGGTGACTTTGCGGGGGTCTCT 
TCTCTAGGATTATCTGTACTTCCCCTCTGTCCTCTTTTACTACGGGAGATCGAGCTAGCTATA 
ACCCACCTTCTTTCATGAGAACCACACTAAATTGCAAAAATTATCCCAGTGCTGGAGGAGGGC 
AGCAGGTTGAGATTATGTTGGCAGGAAGAATGTTGGCATTGATTGGCACGCAGGGGACGAGAG 
CTGCTTTGTGCTTTAAAGGAGCCAAGTTACACCCTGTTTAACCCTGCCTTCAAAGGGACGACT 
CTGTAAGATTCTCTGCTACTTATTCAAGTTGACACGATGCCCTTCACACTCCACCTGAGGTCC 
CGCCTTCCCTCTGCCATAAGGAGTTTGATTCTACAAAAGAAACCAAACATCAGAAATACATCC 
AGCATGGCTGGAGAGCTCCGACCAGCCAGCCTGGTGGTCCTGCCCAGGTCCCTTGCTCCAGCT 
TTTGAAAGATTCTGCCAGGTCAACACTGGTCGTCTACCGCTGCTGGGCCAGAGTGAGCCAGAA 
AAGTGGATGCTGCCCCCTCAAGGTGCTATCTCAGAGACCAGGATGGGCCATCCCCAGTTCTGG 
AAATACGAGTTCGGTGCCTGCACCGGTAGCCTGGCTTCGCTGGAGCAGTACTCGGAGCAGCTG 
AAGGACATGGTGGCCTTCTTCCTGGGCTGCAGCTTCTCCCTGGAGGAGGCCTTGGAGAAAGCG 
GGGCTCCCCAGAAGAGACCCAGCAGGTCACAGCCAGGCGGGTGCATACAAGACAACAGTGCCT 
TGTGTTACCCATGCTGGCTTCTGCTGCCCTCTGGTGGTCACGATGAGGCCCATTCCCAAGGAC 
AAGCTGGAAGGGCTGGTGGGGGCCTGCTGCTCCCTCGGAGGTGAGCAGGGGCAACCTGTTCAC 
ATGGGCGACCCAGAACTGTTGGGAATCAAAGAGCTTTCCAAACCTGCCTACGGGGATGCCATG 
GTGTGTCCCCCAGGGGAGGTTCCAGTGTTCTGGCCTTCTCCGCTGACCAGTCTCGGAGCTGTC 
AGCAGCTGTGAGACCCCACTGGCTTTTGCCAGCATCCCAGGCTGCACAGTTATGACTGACCTG 
AAGGATGCAAAGGCTCCACCTGGTTGTCTCACCCCAGAGAGAATTCCAGAGGTCCATCACATT 
TCCCAAGATCCTCTGCACTACAGCATCGCGTCAGTCTCTGCTTCTCAGAAGATCAGAGAACTA 
GAGTCTATGATCGGCATAGACCCAGGGAACCGGGGGATTGGGCACCTGCTCTGTAAAGATGAG 
•CTGCTGAAGGCCTCTCTCTCGCTGTCCCATGCCCGCTCAGTGCTCATCACCACTGGGTTCCCC 
ACACATTTCAATCATGAGCCTCCAGAAGAGACAGATGGCCCACCAGGAGCTGTTGCTCTGGTT 
GCCTTCCTGCAGGCCTTGGAGAAGGAGGTCGCCATAATCGTTGACCAGAGAGCCTGGAACTTG 
CACCAGAAGATTGTTGAAGATGCTGTTGAGCAAGGTGTTCTGAAGACGCAGATCCCGATATTA 
ACTTACCAAGGTGGATCAGTGGAAGCTGCTCAGGCATTCCTGTGCAAAAATGGGGACCCGCAG 
ACACCTAGATTTGACCACCTGGTGGCCATAGAGCGTGCCGGAAGAGCTGCTGATGGCAATTAC 
TACAATGCAAGGAAGATGAACATCAAGCACTTGGTTGACCCCATTGACGATCTTTTTCTTGCT 
GCGAAGAAGATTCCTGGAATCTCATCAACTGGAGTCGGTGATGGAGGCAACGAGCTTGGGATG 
GGTAAAGTCAAGGAGGCTGTGAGGAGGCACATACGGCACGGGGATGTCATCGCCTGCGACGTG 
GAGGCTGACTTTGCCGTCATTGCTGGTGTTTCTAACTGGGGAGGCTATGCCCTGGCCTGCGCA 
CTCTACATCCTGTACTCATGTGCTGTCCACAGTCAGTACCTGAGGAAAGCAGTCGGACCCTCC 
AGGGCACCTGGAGATCAGGCCTGGACTCAGGCCCTCCCGTCGGTCATTAAGGAAGAAAAAATG 
CTGGGCATCTTGGTGCAGCACAAAGTCCGGAGTGGGGTCTCGGGCATCGTGGGCATGGAGGTG 
GATGGGCTGCCCTTCCACAACACCCACGCCGAGATGATCCAGAAGCTGGTGGACGTCACCACG 
GCACAGGTG TAA CCGTCCATGTTCCGTGTGAGCAGAGTCCCTACCAACGGGCAGGTCTGCATC 
CGGGGAGAATGCAGCTGCTTCTGGCGACAATCCTGCTAGTAAACACTGGTCTTCGGTGAGCAA 
CGAACACTCGCCTGGCCTGGGAAACTGCATGCCCACTTTCTGGGAGGGGTTAGTGCAGGTGCC 
GTGGACAAAGGACAACATTTCTCTGGGGCTTTTTAACTTTTATTCCTAAGACTCTAAAGGCGT 
TGATTTCAACCCTCCTTCACTCTGGCTTCTTCAGGCAACCCACGTGGTCTCCTATGAGAATCT 
TCTCGACAGTTACTTATGGGGACACTTGTGAACAATTAACTGCCAGGGCAGAGCATGAGAACA 
AACATTCCCAGGCCATGTAGGATAGGATACTCCAGACTCCAGTCATCCTCCCCCATCCATGGT 
TTCTGTTACTCATGGTTTCAGTTACTCATAGCCAACTGCAGACCGAAAATACTAAATGAAAAA 
TTTCAGAAATAAACAACTCTTAAGTTTTAAAAAAAAA 
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FIGURE 40 

MPFTLHLRSRLPSAIRSLILQKKPNIRNTSSMAGELRPASLVVLPRSLAPAFERFCQVNTGPL 
PLLGQSEPEKWMLPPQGAISETRMGHPQFWKYEFGACTGSLASLEQYSEQLKDMVAFFLGCSF 
SLEEALEKAGLPRRDPAGHSQAGAYKTTVPCVTHAGFCCPLWTMRPIPKDKLEGLVRACCSL 
GGEQGQPVHMGDPELLGIKELSKPAYGDAMVCPPGEVPVFWPSPLTSLGAVSSCETPLAFASI 
PGCTVMTDLKDAKAPPGCLTPERIPEVHHISQDPLHYSIASVSASQKIRELESMIGIDPGNRG 
IGHLLCKDELLKASLSLSHARSVLITTGFPTHFNHEPPEETDGPPGAVALVAFLQALEKEVAI 
IVDQRAWNLHQKIVEDAVEQGVLKTQIPILTYQGGSVEAAQAFLCKNGDPQTPRFDHLVAIER 
AGRAADGNYYNARKMNIKHLVDPIDDLFLAAKKIPGISSTGVGDGGNELGMGKVKEAVRRHIR 
HGDVIACDVEADFAVIAGVSNWGGYALACALYILYSCAVHSQYLRKAVGPSRAPGDQAWTQAL 
PSVIKEEKMLGILVQHKVRSGVSGIVGMEVDGLPFHNTHAEMIQKLVDVTTAQV 

Signal peptide: 

amino acids 1-17 



Transmembrane domain: 

amino acids 358-378, 517-539 



N-glycosylation site . 

amino acids 28-32 



Tyrosine kinase phosphorylation site. 

amino acids 444-452 

N-myristoylation site. 

amino acids 98-104, 102-108, 123-129, 149-155, 181-187, 190-196, 
238-244, 308-314, 399-405, 413-419, 448-454, 477-483, 482-488, 
487-493 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 233-244, 531-542 
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FIGURE 41 

CTTTCCTGTTTTATCCGCAGCCCTTTTCTTCTTTGAGTTAGTAAAGATTTATTCTGTAACCTG 
ACACTCATCTGGCCCTTTGCAGTTTGCCAGCCATATTCCCATGTGATTTCCCACTGGATCCAG 
GCCCCCATCCGGCTGGCAGGAGGGGGCTCTGACGTACAGGTTGGAAATCAGAAGTCTGTGAGA 
GCGCGGGAGTGCATGGCAGCTCTGGGTCCCAGACCTGGCCCGACCCCTCTGCTTCACCTGCAG 
CTCTGCTGCTCCTCTACTCTTGGGTCGAGATCCCTTTGGAGCCACAGCGAGGAACCCTGTGGT 
CCTCAGGCAGGTGTACCTTGAGTCAGCCAGGAGCCCTCTTTTCCTGTGTCAAAGCCTGCCCTC 
GGGCTCTGCTCACCTCTGGTGACCCTCCAAGATGCCCCTGCCCTCAGTTTCCCCTCATGATCT 
GGCCTCTGCCCCCTTCTCTAGCCACAGCCTCTAGTACACTTTAGCAATACCACCAGACTAGTT 
AGAGTTCCCCACTCACCAAGCAAGACATGCAGTTTCATGCCTCTGTGCCTTCGCTCATGCTGT 
TTCTTCCGACTGGAATGCCTTCCCCTGCTCCTCCTGCCTTGTCTGCCTGGCAAGTTCATCTCT 
CACGATCCCCTCA7\AGGCCCCCTCCTCCAGGAAGGCAACCCCTGTGCCCCTCCCCTCCAGGCT 
ACCTCTGCACTTTGTCAATGCTTCTCTTGTGGCACTTATCACACTGTATTTTACTTGTTTACA 
TGTTTGTCTCCCCTTCTAGACTGTGAATCCTTAAGGGCATGGACTGTATCTTATGCATCTCTG 
TATTTCTGCGCCTAGCACGGTGCCTAGCACACAGTAGGCGCTCAATAAATGTTGAATGAATGA 
ATGATTT 
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FIGURE 42 

MQFHASVPSLMLFLPTGMPSPAPPALSAWQVHLSRSPQRPPPPGRQPLCPSPPGYLCTLSMLL 
LWHLSHCILLVYMFVSPSRL 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Microbodies C-terminal targeting signal. 

amino acids 81-83 
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FIGURE 43 

GTTTCCAACAAGGATGATATGAAGACTTCCCTGAAGAAAGTTGTGAAGGGACCTCCTACGAGA 
T6ATGATGCAGTGTGTGTCCCGCATGTTGGCCCACCCCCTGCATGTCATCTCAATGCGCTGCA 
TGGTCCAGTTTGTGGGACGGGAGGCCAAGTACAGTGGTGTGCTGAGCTCCATTGGGAAGATTT 
TCAAAGAGGAAGGGCTGCTGGGATTCTTCGTTGGATTAATCCCTCACCTCCTGGGCGATGTGG 
TTTTCTTGTGGGGCTGTAACCTGCTGGCCCACTTCATCAATGCCTACCTGGTGGATGACAGCT 
TCAGCCAGGCCCTGGCCATCCGGAGCTATACCAAGTTCGTGATGGGGATTGCAGTGAGCATGC 
TGACCTACCCCTTCCTGCTAGTTGGCGACCTCATGGCTGTGAACAACTGCGGGCTGCAAGCTG 
GGCTCCCCCCTTACTCCCCAGTGTTCAAATCCTGGATTCACTGCTGGAAGTACCTGAGTGTGC 
AGGGCCAGCTCTTCCGAGGCTCCAGCCTGCTTTTCCGCCGGGTGTCATCAGGATCATGCTTTG 
CCCTGGAG TAA CCTGAATCATCTAAAAAACACGGTCTCAACCTGGCCACTGTGGGTGAGGCCT 
GACCACCTTGGGACACCTGCAAGACGACTCCAACCCAACAACAACCAGATGTGCTCCAGCCCA 
GCCGGGCTTCAGTTCCATATTTGCCATGTGTCTGTCCAGATGTGGGGTTGAGCGGGGGTGGGG 
CTGCACCCAGTGGATTGGGTCACCCGGCAGACCTAGGGAAGGTGAGGCGAGGTGGGGAGTTGG 
CAGAATCCCCATACCTCGCAGATTTGCTGAGTCTGTCTTGTGCAGAGGGCCAGAGAATGGCTT 
ATGGGGGCCCAGGTTGGATGGGGAAAGGCTAATGGGGTCAGACCCCACCCCGTCTACCCCTCC 
AGTCAGCCCAGCGCCCATCCTGCAGCTCAGCTGGGAGCATCATTCTCCTGCTTTGTACATAGG 
GTGTGGTCCCCTGGCACGTGG'CCACCATCATGTCTAGGCCTATGCTAGGAGGCAAATGGCCAG 
GCTCTGCCTGTGTTTTTCTCAACACTACTTTTCTGATATGAGGGCAGCACCTGCCTCTGAATG 
GGAAATCATGCAACTACTCAGAATGTGTCCTCCTCATCTAATGCTCATCTGTTTAATGGTGAT 
GCCTCGCGTACAGGATCTGGTTACCTGTGCAGTTGTGAATACCCAGAGGTTGGGCAGATCAGT 
GTCTCTAGTCCTACCCAGTTTTAAAGTTCATGGTAAGATTTGACCTCATCTCCCGCAAATAAA 
TGTATTGGTGATTTGGAAAAAAAAAAAAAAAAA 
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FIGURE 44 

MMMQCVSRMLAHPLHVISMRCMVQFVGREAKYSGVLSSIGKIFKEEGLLGFFVGLIPHLLGDV 
VFLWGCNLLAHFINAYLVDDSFSQALAIRSYTKFVMGIAVSMLTYPFLLVGDLMAVNNCGLQA 
GLPPYSPVFKSWIHCWKYLSVQGQLFRGSSLLFRRVSSGSCFALE . 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

Transmembrane domains: 

amino acids 51-72, 97-114 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 160-163 

N-myristoylation sites. 

amino acids 34-39, 100-105, 123-128, 165-170 
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FIGURE 45 

GCTCACTCTTTGGGTCCACACTGCCTTTATGAGCTGTAACACTCACTGGGAATGTCTGCAGCT 
TCACTCCTGAAGCCAGCGAGACCACGAACCCACCAGGAGGAACAAACAACTCCAGACGCGCAG 
CCTTAAGAGCTGTAACACTCACCGCGAAGGTCTGCAGCTTCACTCCTGAGCCAGCCAGACCAC 
GAACCCACCAGAAGGAAGAAACTCCAAACACATCCGAACATCAGAAGGAGCAAACTCGTGACA 
CGCCACCTTTAAGAACCGTGACACTCAACGCTAGGGTCCGCGGCTTCATTCTTGAAGTCAGTG 
AGACCAAGAACCCACCAATTCCGGACACGGCA/^AGTAACATCCTAGACATGGCTTTAGAGATC 
CACATGTCAGACCCCATGTGCCTCATCGAGAACTTTAATGAGCAGCTGAAGGTTAATCAGGAA 
GCTTTGGAGATCCTGTCTGCCATTACGCAACCTGTAGTTGTGGTAGCGATTGTGGGCCTCTAT 
CGCACTGGCAAATCCTACCTGATGAACAAGCTGGCTGGGAAGAACAAGGGCTTCTCTGTTGCA 
TCTACGGTGCAGTCTCACACCAAGGGAATTTGGATATGGTGTGTGCCTCATCCCAACTGGCCA 
AATCACACATTAGTTCTGCTTGACACCGAGGGCCTGGGAGATGTAGAGAAGGCTGACAACAAG 
AATGATATCCAGATCTTTGCACTGGCACTCTTACTGAGCAGCACCTTTGTGTACAATACTGTG 
AACAAAATTGATCAGGGTGCTATCGACCTACTGCACAATGTGACAGAACTGACAGATCTGCTC 
AAGGCAAGAAACTCACCTGACCTTGACAGGGTTGAAGATCCTGCTGACTCTGCGAGCTTCTTC 
CCAGACTTAGTGTGGACTCTGAGAGATTTCTGCTTAGGCCTGGAAATAGATGGGCAACTTGTC 
ACACCAGATG7\ATACCTGGAGAATTCCCTAAGGCCAAAGCAAGGTAGTGATCAAAGAGTTCAA 
AATTTCAATTTGCCCCGTCTGTGTATACAGAAGTTCTTTCCAAAAAAGAAATGCTTTATCTTT 
GACTTACCTGCTCACCAAAAAAAGCTTGCCCAACTTGAAACACTGCCTGATGATGAGCTAGAG 
CCTGAATTTGTGCi\AGAAGTGACAGAATTCTGTTCCTACATCTTTAGCCATTCTATGACCAAG 
ACTCTTCCAGGTGGCATCATGGTCAATGGATCTCGTCTAAAGAACCTGGTGCTGACCTATGTC 
AATGCCATCAGCAGTGGGGATCTGCCTTGCATAGAGAATGCAGTCCTGGCCTTGGCTCAGAGA 
GAGAACTCAGCTGCAGTGCAAAAGGCCATTGCCCACTATGACCAGCAAATGGGCCAGAAAGTG 
CAGCTGCCCATGGAAACCCTCCAGGAGCTGCTGGACCTGCACAGGACCAGTGAGAGGGAGGCC 
ATTGAAGTCTTCATGAAAAACTCTTTCAAGGATGTAGACCAAAGTTTCCAGAAAGAATTGGAG 
ACTCTACTAGATGCAAAACAGAATGACATTTGTAAACGGAACCTGGAAGCATCCTCGGATTAT 
TGCTCGGCTTTACTTAAGGATATTTTTGGTCCTCTAGAAGAAGCAGTGAAGCAGGGAATTTAT 
TCTAAGCCAGGAGGCCATAATCTCTTCATTCAGAAAACAGAAGAACTGAAGGCAAAGTACTAT 
CGGGAGCCTCGGAAAGGAATACAGGCTGAAGAAGTTCTGCAGAAATATTTAAAGTCCAAGGAG 
TCTGTGAGTCATGCAATATTACAGACTGACCAGGCTCTCACAGAGACGGAAAAAAAGAAGAAA 
GAGGCACAAGTGAAAGCAGAAGCTGAAAAGGCTGAAGCGCAAAGGTTGGCGGCGATTCAAAGG 
CAGAACGAGCAAATGATGCAGGAGAGGGAGAGACTCCATCAGGAACAAGTGAGACAAATGGAG 
ATAGCCAAACAAAATTGGCTGGCAGAGCAACAGAAAATGCAGGAACAACAGATGCAGGAACAG 
GCTGCACAGCTCAGCACAACATTCCAAGCTCAAAATAGAAGCCTTCTCAGTGAGCTCCAGCAC 
GCCCAGAGGGCTGTTAATAACGATGATCCATGTGTTTTACTCTAAAGTGCTAAATATGGGAGT 
TTCCTTTTTTTACTCTTTGTCACTGATGACACAACAGAAAAGAAACTGTAGACCTTGGGACAA 
T C AAC A T T T AAAT AAA C T T T AT AAT TAT T AAA 
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FIGURE 46 

MALEIHMSDPMCLIENFNEQLKVNQEALEILSAITQPVVVVAIVGLYRTGKSYLMNKLAGKNK 
GFSVASTVQSHTKGIWIWCVPHPNWPNHTLVLLDTEGLGDVEKADNKNDIQIFALALLLSSTF 
VYNTVNKIDQGAIDLLHNVTELTDLLKARNSPDLDRVEDPADSASFFPDLVWTLRDFCLGLEI 
DGQLVTPDEYLENSLRPKQGSDQRVQNFNLPRLCIQKFFPKKKCFIFDLPAHQKKLAQLETLP 
DDELEPEFVQQVTEFCSYIFSHSMTKTLPGGIMVNGSRLKNLVLTYVNAISSGDLPCIENAVL 
ALAQRENSAAVQKAIAHYDQQMGQKVQLPMETLQELLDLHRTSEREAIEVFMKNSFKDVDQSF 
QKELETLLDAKQNDICKRNLEASSDYCSALLKDIFGPLEEAVKQGIYSKPGGHNLFIQKTEEL 
KAKYYREPRKGIQAEEVLQKYLKSKESVSHAILQTDQALTETEKKKKEAQVKAEAEKAEAQRL 
AAIQRQNEQMMQERERLHQEQVRQMEIAKQNWLAEQQKMQEQQMQEQAAQLSTTFQAQNRSLL 
SELQHAQRAVNNDDPCVLL 

Important features of the protein: 
Transmembrane domains: 

amino acids 31-49, 114-131 

N-glycosylation sites. 

amino acids 90-94, 144-148, 287-291, 563-567 

N-myristoylation sites. 

amino acids 45-51, 283-289 

Prenyl group binding site. 

amino acids 583-588 

ATP/GTP-bindin^ site motif A (P-loop) . 

«. 

amino acids 45-53 
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FIGURE 47 

CACTCATTCATTCCAAAGGGTCTCTCAAGGCAATGGTAATGTGCAAGGAGGTGATACCTAAAT 
GAATGACCAAAAGAACATGCTTCTGCTTTTGTGTGTCTCCTACATTTTAGACATTTGTTTGTT 
TCTCTTGGTAGCCTTTAAATTCCTTGAAGCCCAGGACCATGTCTCACTTACCTTTGTGTTTCC 
ACTAACTAGTCTACCTCCTGGAATTGGCAGATACTCAGTGAAAGCCTGTGAAATAAGTGATGT 
CTATTTCTAGCATATTATTCTGAGATTTAATGATAGATTTAGTGATTGAATGAGATTTCCATT 
TTCAAATACAGCAAAAGCATAACTATTTTCATTCATTCATATTCATTCAACTTCATTCTCAAA 
ATTAGGTCCTGAGTTAACTAATAATTACCTTTGAAATGTGTGGGTTATTTGAGGCAATCAGGT 
GGTGACATTGAGCTCTCAGCCAGAGTTTGTTTCTGGAATTGATTCAGTTCCATTGCATTGATT 
TTTGTTCTCAGAAGCCAAGGTTTCCCATGAAAAATCATTCCCACTTGAATTGGGCTGTGATTC 
TTGCTGCGTTTAAGTAAAGGAAGCCTCTTGGTTCTAGTTCTGCAAACTTACACACTGAACTGG 
GACAAGTTTTTGTTTAGAGTAATGGCTGGGAAAAGAGGAACCTTTCATTTTATTCAGAAGTCA 
AAAACAAAGGCCTCCCAGCCACCTGGAGATGTTTTGTTGCAGACACCAGCCTGGCTCTGTCTT 
TATGCCTAACAATTGAGCATCCAGTCTTCTTTGTGCTGGGACCATTGCTCAGCTCTGCAAGGG 
GAAAAGAGGGAGAAAGCCAGAGCTGCCAGGCTTCTTGCACTGGGGCCGGGGGAGGGTTCCTGG 
GAAGCAGGTGCTCTCTGGCTTCTTGGTACGTGAGGCTCTCGGAGCTGCCTCTCCTCTGACCCT 
CAGGTCCTCACCGAGTTTGCTCCAGGAGTATATTGAAAACATACCCAGTGCTCTCTCAAGCAC 
CCACTGCTTAGAGGGCCCAGATTTCTTTTCCTTCTTTCCCTTGCAGAGCTGGAGACTGCATCG 
GGCATCTGGTGTTTAAACTAAACAGGAAAACTGACTAAAGGTCCACAGTGCTCATTGTGTAGA 
CTAGCTGCCCTCCG ATG GGTGCTCTGATTATCAGTGGTTCCAGTGCAGGGCCTGTCACTAAAC 
AGGCCTCACTTCCTCCTTGGGGGCTTTCCCATGGGAGGTGTGGCTTTTTACTCTACATGGAAA 
TGACTCTCTGCAGCCACAGAACACAGTCATTTTCTGAATTATCCCAGTCTCTCATGCGCCCTG 
GATTCCTCCAGATGCCTTATATCTCTTGTGCAAAGTTGTCTAAAATTTGGTTCCCAGCTTCCA 
AGCCTTGCCTTTTGGCCTTCCTGGAAGTATTTTTGTTGATGAGTCGTCTGTCATTATTCTCTA 
AAATGATTTGCTTTTTGTTTCTTTCATTCCTATTTCCACCCCACATATACACACATGCTTCTT 
AACTTAGGGGATTACATGCCAATAAATCTATTGTTGAAAATGCACTAATACTATCGCAAAGAC 
GAAAATTCACAGGCTGAACCGTTGTAAGTCCATATGCTCCTCAACTTACATGTGTGATGGAGT 
TATGCCCAAATAAGTCCATCGTCAAGTTGAAAAATCAAAATCAAGCCATCTTAGGTTGAGGAC 
CATTTGTTTGTACCTCCAAAGATGTCATATCTTTAAACATACTCCCTAGCTTTTCTTTTTACT 
TTTTATTTTGAAGTAATTATAGAATCACAGAAAGTTGCAAAAAA 
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FIGURE 48 

MGALIISGSSAGPVTKQASLPPWGLSHGRCGFLLYMEMTLCSHRTQSFSELSQSLMRPGFLQM 
PYISCAKLSKIWFPASKPCLLAFLEVFLLMSRLSLFSKMICFLFLSFLFPPHIYTHAS 

Important features of the protein: 
Signal peptide: 

amino acids 1-41 

Transmembrane domain: 

amino acids 88-107 

Casein kinase II phosphorylation site. 

amino acids 47-50 

N-myristoylation site. 

amino acids 24-29 
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FIGURE 49 

GGCTTCTACAGTCCACAACACCCACCAGCCCCAGGCCCAGCAGAATGAGCCCAGTGAGTGCCGGGGCTCCCAGTT 
TGGCTGTTGCTATGACAACGTGGCCACTGCAGCCGGTCCTCTTGGGGAAGGCTGTGTGGGCCAGCCCAGCCATGC 
CTACCCCGTGCGGTGCCTGCTGCCCAGTGCCCATGGGTCTTGTGCAGACTGGGCTGCCCGCTGGTACTTCGTTGC 
CTCTGTGGGCCAATGTAACCGCTTCTGGTATGGCGGCTGCCATGGCAATGCCAATAACTTTGCCTCGGAGCAAGA 
GTGCATGAGCAGCTGCCAGGGATCTCTCCATGGGCCCCGTCGTCCCCAGCCTGGGGCTTCTGGAAGGAGCACCCA 
CACGGATGGTGGCGGCAGCAGTCCTGCAGGCGAGCAGGAACCCAGCCAGCACAGGACAGGGGCCGCGGTGCAGAG 
AAAGCCCTGGCCTTCTGGTGGTCTCTGGCGGCAAGACCAACAGCCTGGGCCAGGGGAGGCCCCCCACACCCAGGC 
CTTTGGAGAATGGCCATGGGGGCAGGAGCTTGGGTCCAGGGCCCCTGGACTGGGTGGAGATGCCGGATCACCAGC 
GCCACCCTTCCACAGCTCCTCCTACAGATCTCACTTCCCACCTCTCCAGGATTAGCTTGGCAGGTGTGGAGCCCT 
CGTTGGTGCAGGCAGCCCTGGGGCAGTTGGTGCGGCTCTCCTGCTCAGACGACACTGCCCCGGAATCCCAGGCTG 
CCTGGCAGAAAGATGGCCAGCCCATCTCCTCTGACAGGCACAGGCTGCAGTTCGACGGATCCCTGATCATCCACC 
CCCTGCAGGCAGAGGACGCGGGCACCTACAGCTGTGGCAGCACCCGGCCAGGCCGCGACTCCCAGAAGATCCAAC 
TCCGCATTATAGGGGGTGACATGGCCGTGCTGTCTGAGGCTGAGCTGAGCCGCTTCCCTCAGCCCAGGGACCCAG 
CTCAGGACTTTGGCCAAGCGGGGGCTGCTGGGCCCCTGGGGGCCATCCCCTCTTCACACCCACAGCCTGCAAACA 
GGCTGCGTTTGGACCAGAACCAGCCCCGGGTGGTGGATGCCAGTCCAGGCCAGCGGATCCGGATGACCTGCCGTG 
CCGAAGGCTTCCCGCCCCCAGCCATCGAGTGGCAGAGAGATGGGCAGCCTGTCTCTTCTCCCAGACACCAGCTGC 
AGCCTGATGGCTCCCTGGTCATTAGCCGAGTGGCTGTAGAAGATGGCGGCTTCTACACCTGTGTCGCTTTCAATG 
GGCAGGACCGAGACCAGCGATGGGTCCAGCTCAGAGTTCTGGGGGAGCTGACAATCTCAGGACTGCCCCCTACTG 
TGACAGTGCCAGAGGGTGATACGGCCAGGCTATTGTGTGTGGTAGCAGGAGAAAGTGTGAACATCAGGTGGTCCA 
GGAACGGGCTACCTGTGCAGGCTGATGGCCACCGTGTCCACCAGTCCCCAGATGGCACGCTGCTCATTTACAACT 
TGCGGGCCAGGGATGAGGGCTCCTACATGTGCAGTGCCTACCAGGGGAGCCAGGCAGTCAGCCGCAGCACCGAGG 
TGAAGGTGGTCTCACCAGCACCCACCGCCCAGCCCAGGGACCCTGGCAGGGACTGCGTCGACCAGCCAGAGCTGG 
CCAACTGTGATTTGATCCTGCAGGCCCAGCTTTGTGGCAATGAGTATTACTCCAGCTTCTGCTGTGCCAGCTGTT 
CACGTTTCCAGCCTCACGCTCAGCCCATCTGGCAGTAGGGATGAAGGCTAGTTCCAGCCCCAGTCCAAAATAGTT 
CATAGGGCTAGGGAGAAAGGAAGATGGACTCTTGGCTTCCTCTCTCTGGCTGGCAAAGGGAGTTATCTTCTGGAA 
TACATTAGCTCTTTCAAAAACCCACCCAGTGTTTAGCCTCAACGGCAGCCAGTTACCAGCTTCTCTCTGTAGCCT 
TCAGCAGTGTTTGCATCTCTGACATAACCACAGGCTGCTGTTTTCAAGAAGAGCAATCTGTTTGGATAAGAAAAA 
CCTTTACTTTACAGCTTCCCTTTATAATTTGTTACACAGGAATAGTTAAATGCATTTGTTTGTTTGTTTTTTGAG 
ACGGAGTTTCACTCTTGTTGCCCAGGCTGGAGGGCAATGGCGCGATCTCAGCTCACTGCAACCTCCGTCTCCTGG 
GTTCTTGATTCTCCTGTGTCAGCCTTCTGAGTAGCTGGGATTACAGATGCCTATCACCATGCCTGGGTAATTTTT 
GTATTTTTAGTTGAGATGGGGTTTCGCCATGTTGGCCAGGCTGGTCTCGAACTTCTGACCTCAGATGATCTGCCC 
GCCTCAGCCTCCCAAAGTGCTGGGATTAGAGGCATGAGCCACCACGCCCAGCCATCAATGCATTTTTTTTATTTT 
TTTTTTGAGACAGAGTTTCGCACTTCTTGCCCAGGCTGGAGTACAATGGTGCGATCTTGGCTCACTGCAACCTCC 
ACCTCCTGGGTTCAAGCGCTTCTCCAGCCTCAGCCTCCTGAGTAGCTGGGATTACAGGTATGTGCCACCATGCCT 
GGCTAATTTTGTATTTTTGGTGGAGACGGGGTTTCTCCATGTTGGTCAGACTGGTCTTGAACTCCCGACCTCAGG 
TAATCCGCCCGCCTCCGCCTCCCAAAATGCTGGGATTAGAGGTGTGAGCCACTGTGCCCAGCCCATCAATGTGTT 
TTAAAGCTAGCTGTCAGGGTTCCACTTAATTTAAAGCTGGGCAGGGAGATGTGTAATGATTTCAAAGTTAACACC 
TGTTTGTTTTCTAAAGGGCATGCCAAGTCCTGCTGTATCAGGGAAGTATTCTGTGCTAAAATCAGCGATGGTTCA 
TTGCTCTAGTCTCTCTCACCCTTCTAGGCAGTGCATCAGTCAGCTCTAAATCTGGTGCAGAGGGTTAACAGCATA 
ACCCTTGTTGGCAAAATGGAATAGATGTTAAGACCTCAAATAGGGATTTGGGATGAAACAGCTGCAGTTAGCACT 
GTTATCTGAGCATGAAAGAACTGGAAACGCTCCTTACGTCGAGATGTTGGACCTTGAAGCCCTCCTGAGGCCAAC 
ATGCAAATCTGGCTGTGACGGTTCATCTGACACCTGTGTAAAGCTGACCAGCCTGCTCTGTACAGTGACAATGAG 
GAGCCCCTCTCTTCCTTAAGTAGGAATCTGTGAAGCAAAATGTTTGCTGCCAAAGACAAATCAGACTGTCAGTCA 
TTAAAAACAGCATTAGCAGGATGAGGATAGCAATGGGGAAGGGTTGTGGGCAATGCAGTAACAGGGAAATGGCTT 
CAGAAATGGTTTGAGTTGGAAGACAACATTCTTCATCTCTCAGGACTTCTAATTCCTTGATGCTAAAAGAAGAGG 
CATGGATTCTATGAGCTTCCAAGTCCCTTTCCACTTTAACCTTCTACAAATCTTTCAGAGGACTGCCTAGTAGCA 
AAGGTTATTCCTGGACACAGGAAAGACGGGCATTACAGGGACCAAAGCTCTGAAAGGTGACTTTTATTACCAACA 
CACTGGCTGGAAAAGGGACAAACCACATCACGGGTGAGTGATACTTCTCAGTCTTCTCTACTCATTCAACAAAGG 
AAATGTGGGCTGGGGCAGAGGTCTTTTTTCATTTAATACTGGAAAAATATTGAAGAGCATCCATGTTCACTTATG 
GCTGGTTTTGCTATAGAAATTGGAAAATAAAGGCCACTTTTTTG 



WO 01/40466 



PCT/US00/32678 



50/550 

FIGURE 50 

MGPVVPSLGLLEGAPTRMVAAAVLQASRNPASTGQGPRCRESPGLLWSGGKTNSLGQGRPPT 
PRPLENGHGGRSLGPGPLDWVEMPDHQRHPSTAPPTDLTSHLSRISLAGVEPSLVQAALGQLV 
RLSCSDDTAPESQAAWQKDGQPISSDRHRLQFDGSLIIHPLQAEDAGTYSCGSTRPGRDSQKI 
QLRIIGGDMAVLSEAELSRFPQPRDPAQDFGQAGAAGPLGAIPSSHPQPANRLRLDQNQPRVV 
DASPGQRIRMTCRAEGFPPPAIEWQRDGQPVSSPRHQLQPDGSLVISRVAVEDGGFYTCVAFN 
GQDRDQRWVQLRVLGELTISGLPPTVTVPEGDTARLLCWAGESVNIRWSRNGLPVQADGHRV 
HQSPDGTLLIYNLRARDEGSYMCSAYQGSQAVSRSTEVKWSPAPTAQPRDPGRDCVDQPELA 
NCDLILQAQLCGNEYYSSFCCASCSRFQPHAQPIWQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

Tyrosine kinase phosphorylation site. 

amino acids 392-400 

N-myristoylation sites . 

amino acids' M5, 50-56, 112-118, 146-152/ 173-179, 195-201, 
220-226, 229-235, 280-286, 306-312, 336-342, 397-403. . 



Myelin PO protein. 

amino acids 153-182 
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FIGURE 51 

CAGGCAGAAGCGAACAAAGACCCAGCAAGAGAAGGCAGAGGCTAAGACCCATCCCGTATCTGC 
TCTCCTGAAATAATTCTGGAGTCATGCCTGAAATGCCAGAGGACATGGAGCAGGAGGAAGTTA 
ACATCCCTAATAGGAGGGTTCTGGTTACTGGTGCCACTGGGCTTCTTGGCAGAGCTGTACACA 
AAGAATTTCAGCAGAATAATTGGCATGCAGTTGGCTGTGGTTTCAGAAGAGCAAGACCAAAAT 
TTGAACAGGTTAATCTGTTGGATTCTAATGCAGTTCATCACATCATTCATGATTTTCAGCCCC 
ATGTTATAGTACATTGTGCAGCAGAGAGAAGACCAGATGTTGTAGAAAATCAGCCAGATGCTG 
CCTCTCAACTTAATGTGGATGCTTCTGGGAATTTAGCAAAGGAAGCAGCTGCTGTTGGAGCAT 
TTCTCATCTACATTAGCTCAGATTATGTATTTGATGGAACAAATCCACCTTACAGAGAGGAAG 
ACATACCAGCTCCCCTAAATTTGTATGGCAAAACAAAATTAGATGGAGAAAAGGCTGTCCTGG 
AGAACAATCTAGGAGCTGCTGTTTTGAGGATTCCTATTCTGTATGGGGAAGTTGAAAAGCTCG 
AAGAAAGTGCTGTGACTGTTATGTTTGATAAAGTGCAGTTCAGCAACAAGTCAGCAAACATGG 
ATCACTGGCAGCAGAGGTTCCCCACACATGTCAAAGATGTGGCCACTGTGTGCCGGCAGCTAG 
CAGAGAAGAGAATGCTGGATCCATCAATTAAGGGAACCTTTCACTGGTCTGGCAATGAACAGA 
TGACTAAGTATGAAATGGCATGTGCAATTGCAGATGCCTTCAACCTCCCCAGCAGTCACTTAA 
GACCTATTACTGACAGCCCTGTCCTAGGAGCACAACGTCCGAGAAATGCTCAGCTTGACTGCT 
CCAAATTGGAGACCTTGGGCATTGGCCAACGAACACCATTTCGAATTGGAATCAAAGAATCAC 
TTTGGCCTTTCCTCATTGACAAGAGATGGAGACAAACGGTCTTTCATTAGTTTATTTGTGTTG 
GGTTCTTTTTTTTTTTAAATGAAAAGTATAGTATGTGGCACTTTTTAAAGAACAAAGGAAATA 
GTTTTGTATGAGTACTTTAATTGTGACTCTTAGGATCTTTCAGGTAAATGATGCTCTTGCACT 
AGTGAAATTGTCTAAAGAAACTAAAGGGCAGTCATGCCCTGTTTGCAGTAATTTTTCTTTTTA 
TCATTTTGTTTGTCCTGGCTAAACTTGGAGTTTGAGTATAGTAAATTATGATCCTTAAATATT 
TGAGAGTCAGGATGAAGCAGATCTGCTGTAGACTTTTCAGATGAAATTGTTCATTCTCGTAAC 
CTCCATATTTTCAGGATTTTTGAAGCTGTTGACCTTTTCATGTTGATTATTTTAAATTGTGTG 
AAATAGTATAAAAATCATTGGTGTTCATTATTTGCTTTGCCTGAGCTCAGATCAAAATGTTTG 
AAGAAAGGAACTTTATTTTTGCAAGTTACGTACAGTTTTTATGCTTGAGATATTTCAACATGT 
TATGTATATTGGAACTTCTACAGCTTGATGCCTCCTGCTTTTATAGCAGTTTATGGGGAGCAC 
TTGAAAGAGCGTGTGTACATGTATTTTTTTTCTAGGCAAACATTGAATGCAAACGTGTATTTT 
TTTAATATAAATATATAACTGTCCTTTTCATCCCATGTTGCCGCTAAGTGATATTTCATATGT 
. GTGGTTATACTCATAATAATGGGCCTTGTAAGTCTTTTCACCATTCATGAATAATAATAAATA 
TGTACTGCTGGCATGTAATGCTTAGTTTTCTTGTATTTACTTCTTTTTTTAAATGTAAGGACC 
AAACTTCTAAACTAATTGTTCTTTTGTTGCTTTAATTTTTAAAAATTACATTCTTCTGATGTA 
ACATGTGATACATACAAAAGAATATAGTTTAATATGTATTGAAATAAAACACAATAAAATT 
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FIGURE 52 

MPEMPEDMEQEEVNIPNRRVLVTGATGLLGRAVHKEFQQNNWHAVGCGFRRARPKFEQVNLLD 
SNAVHHIIHDFQPHVIVHCAAERRPDWENQPDAASQLNVDASGNLAKEAAAVGAFLIYISSD 
YVFDGTNPPYREEDIPAPLNLYGKTKLDGEKAVLENNLGAAVLRIPILYGEVEKLEESAVTVM 
FDKVQFSNKSANMDHWQQRFPTHVKDVATVCRQLAEKRMLDPSIKGTFHWSGNEQMTKYEMAC 
AIADAFNLPSSHLRPITDSPVLGAQRPRNAQLDCSKLETLGIGQRTPFRIGIKESLWPFLIDK 
RWRQTVFH 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 105-127 

N-glycosylation site. 

amino acids 197-201 

N-myristoylation site. 

amino acids 303-309 

Short-chain dehydrogenases/reductases family proteins. 

amino acids 18-30 
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FIGURE 53 

TGGGCTCCCTCCAGCACTGCTGTTGCCTGCTGCCTAAGATGGGTGACACTTGGGCCCAGCTTCCCTGGCCTGGGC 

CACCCCACCCAGCAATGCTGCTGATCTCCCTCCTCTTGGCAGCCGGGTTGATGCACTCGGATGCCGGCACCAGCT 

GCCCCGTCCTTTGCACATGCCGTAACCAGGTGGTGGATTGTAGCAGCCAGCGGCTATTCTCCGTGCCCCCAGACC 

TGCCAATGGACACCCGAAACCTCAGCCTGGCCCACAACCGCATCACAGCAGTGCCGCCTGGCTACCTCACATGCT 

ACATGGAGCTCCAGGTGCTGGATTTGCACAACAACTCCTTAATGGAGCTGCCCCGGGGCCTCTTCCTCCATGCCA 

AGCGCTTGGCACACTTGGACCTGAGCTACAACAATTTCAGCCATGTGCCAGCCGACATGTTCCAGGAGGCCCATG 

GGCTAGTCCACATCGACCTGAGCCACAACCCCTGGCTGCGGAGGGTGCATCCCCAGGCCTTTCAGGGCCTCATGC 

AGCTCCGAGACCTGGACCTCAGTTATGGGGGCCTGGCCTTCCTCAGCCTGGAGGCTCTTGAGGGCCTACCGGGGC 

TGGTGACCCTGCAGATCGGTGGCAATCCCTGGGTGTGTGGCTGCACCATGGAACCCCTGCTGAAGTGGCTGCGAA 

ACCGGATCCAGCGCTGTACAGCAGATTCTCAGCTGGCTGAGTGCCGGGGCCCTCCTGAAGTCGAGGGCGCCCCGC 

TCTTCTCACTCACTGAGGAGAGCTTCAAGGCCTGCCACCTGACCCTGACCCTGGATGATTACCTATTCATTGCGT 

TCGTGGGCTTCGTGGTCTCCATTGCTTCTGTGGCGACCAACTTCCTCCTGGGCATCACTGCCAACTGCTGCCACC 

GCTGGAGCAAGGCCAGTGAAGAGGAAGAGATCTGACATGCCTGCCTCTCATCCCTCCATGCTGCTGACCGCCACA 

GCTGCTGGCCACCAGACGCCCTCCCTGATTGCTCACTCTGGTTCCATGGTGACCTGGCTGCCTCAGTCATGGTTC 

AAGCAAGGTGGGGACACTCATTTTGTATGAGCATCTGCTTTGGGCCAGGCGGCACGCTAGGAATTGGGAACATCA 

GATGAACTGACTCAGTCCCTGCCCTCAAGGCACTTCCCTCTGGTCAAGGAGAGAGATCCAAAAACTATTCCCTTT , 

AAGACTATATGTCAGGACTCTGAGCACGTCATTATGGAGGCCCAGAGGAGGAGCCATCATCTGTATCTAGCAATG 

TCCATGAGAATTATAAGATTAGAGTGATTTGTGAACTGGGTCATCAGGAAATATCTACTTTGTCAGGTAGGCAAA 

GAAGGGTGTCTGCACATGGCAGAGGCCAGAATATGCATAGTGTGCTGTGTTGAGAAGAGTGAACAGTTCCTGGTC 

ACTTACTTGTATAGAGGGGGTGTGGCACAGAACTCAAACCTACCCCTCACCTCCTGACACCAAAACTGTCAGCTC 

TCAGCAATGCCAGCCACTGCCTACAGGGAGTAAGAACACCTCTATGACAGCCCCTGGCCTCCTTCCACCAGCAGC 

TACCAGGTGAGACCACCTCCCAGTGACTGCCCCCATATGACCAAATGTCACCAGTTGGTGAGGTCCCAGGCAGCA 

GGCTGAGGATGGACACTTTCAATGCCCTTGCTCCTGCCTCTCACTCAAGTTTTGCTTCAGAAGAGAGAGGCAGGA 

GGCCCAGCAACTGGGGCAGCAAGAGTCCTGGCACCTTGGGATCCTAATCATGTGACTGTTCTTGCCACAGTGCTC 

ATGCCACAGGGTCTCACCAGGAAAGTGCACTGTGGGCCACAGACCCACAGCCTGGCAGCACCCAGAGCTAAAAGG 

GGACAAAGGCAGCACAGTTATGACCATATGAGGCTTTGCATTTTCTTCTAAGCAACTTACCCACGTTAAGCATGA 

GGGTGAGAGAGCTATTAAATACTAAGCCCTTGCCAGTGTCAGGTACTTTGAAAAGCTCTCTGCACAAACCATTCC 

CTTTGACACACACACACACAAATCTTTTGAGGTGAACGCTGTTGTTCCCATTTTACGGATGAGGCAACTAAGGCT 

CAGAGAGGTTAAAGTCACATGCCACTATGAGCAAGATAAAGTCTGTGCTCTTTCTACTGCCCCATCCAAGTTGGG 

GAACATCACCATTCCCTCTAGAGTTATATAAATTCAAATTCAACTAGAGCTGACAAAGTTCCTCATAAGGTCCAG 

GCACTCCTCTGGGCACTTTTATATCTATTGACTCACTTCTTTCAATTCTCACAGCAACACTGCCTGGTGGTTTTT 

ATTATCCCCATTTGACAGATGAATTAATCGTAGAGAGTTGAGTGACTTACCCAAGGTTGTCTGGATAAGCCCTAG 

AAGGAAGGGGGTAGGCAGCTCCATTCAGGGAAACTGCATCTAATCAGTCAGTCAAAAATCAAGTAACTTTACGAG : 

CAAAGCACAATTATCATCATCGTGGTCTTCTTCATCAGTTTCGTCAGCAGCATCATTATCTTCCCTCTATTTGTT 

CAGCACCGGATAGTTCATGAGTATTTTTGCATCATTCTCCTTGACTTTTCACATCCCTGTGCAGGAGGTAAATCA 

AACATCAGTAATCCTGTTTTACAGATGGGGAAAAAAGTCTCAAGGTTGGATATGACTTGCTATGTGGCAAGGTTG 

GGGCTCAACCCTAACACAGTTCTCTTTCCAGTGCTTTCTCAAGTGCTTGGGGAAGAGAATGCCTCAGAAGGCTGG 

GTAGTGGGGCCCTGGAATTCAGCATCCATGAATGTGCTAGTGGATAAGCTAAATAGAAGGCAGCCAAACCCATCT 

GCTGTACAGATTGAACTATGCTCACGGTAGGGCAAATTGCAGGCTCTGAAACAGAGACTACACAGGTAACACCTG 

AATAGGAGACTCCTGCTTTACAATGTGTAGATAAAACATCAGCAATGGTGGCCATGGTGGCAGTCATGTGAAAAG 

TAAGATCTTTGGGAATCAAGAAAGGAAGCTGTGTTAACCACTCCTGCTCAAGCCCTGCTGCGTGTGTTGCAAGAG 

ATACTAAGAGAGCAAGAAAGCTATAGGTGAGAACCTCTGCAGTTTAGGAGAAGAACATCAAGGCACAGTCCAACA 

TGCTGATAAGTCTGGCCAGGAGGAGAATTAAAACAGGGGCTTTCCACACCTCCCTTGCCCCAAGCTCCAGCGGTA 

TTCTATCAGCCCATCCTCCTGGAAAGCCTGAAAGGAATGAAGGAGGCTAATAAGTCATCTTCCAGGAAGGCATCC 

CTCACTCGTGCTTCCCTGAGCTAGTCAACCAAAAGAGTCTTCAGTW^CTTTGCTAGACCTGAAGTACTTG/y^CCT 

GTGTCCCCTGAATCTTTCTTACAACATCTGGGACAAATCCCTGGTCCTGTGACATCCGAAGCAGAACTGTGCCCT 

GCTCTCTCCTTCTGTGATGACCAAGGATGGTGAACTCAAGTTGTTCTCTACAAGCCAGGCCAGCAACCTAAATAC 

TTGGAGAGGAACTTTTAGAAACTATAATCCTGACAAAATAGAAAAGTTTCCCATAGGGGCATACCATAATACTAT 

AATAACCTCCCAGGAACTATTGTTTGCCAAAATGTAGTTAATATATTTTAAGATATATGCTTTTTTGCATAGGAC 

TAGAACCAGAAAAGACACCAAATGCCCCCTTGACATCAATGTCCTTTCTAGTGGGACAATTTGGTCTCCATTAAT 

GCCAAACCTTTCTGAACAGGATACATGGCTTTTAAAGGACAGATGTTTCTCCTGCTGCTAGAAGTTCCTCAGTTT 

ACTAGAGCACAATGAGGAAAGTATTCAACCTCCCTACTGCCAAGGAATTCCCTGCTTCTCCCCCACCGCCATCAT 

CTTGTCCAAGCTATCAGAAGCAACCTTCTAGAGATAATCTAACAATCCTGATTAGAATTGCTCCCATATCCCTGG 

TGACCACAGGCTTCATTCAAATTGTCCAAACTGGTTAACATGTATGTGATGGGGTATCTCTGCATCTGTATGTCT 

GTCTGCGAGGTTCCTTGTATATTGGCTGTCCGCTGACTTGGGACAGATCTCTCTAGAACTTGGGTTCAGTTCTCT 

GACATAGTCCACTCAGCCATAGGCTGAGTGGCTAAATATGCATAAATAAGCATGCCTAAATAGGCATATATAGGT 

TGGTGCAAAAGTAATTGCGGTTTTTGCCATTAAAAATGATGGCAAAAATCCCAATTACTTTTGCGTCAATCTAAT 

ATTACATTGCTTGATAGATTAAGATGGAATCCCACCAGGTTTAGGGTAGGACTGGATGCTCAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 54 

MLLISLLLAAGLMHSDAGTSCPVLCTCRNQWDCSSQRLFSVPPDLPMDTRNLSLAHNRITAV 
PPGYLTCYMELQVLDLHNNSLMELPRGLFLHAKRLAHLDLSYNNFSHVPADMFQEAHGLVHID 
LSHNPWLRRVHPQAFQGLMQLRDLDLSYGGLAFLSLEALEGLPGLVTLQIGGNPWVCGCTMEP 
LLKWLRNRIQRCTADSQLAECRGPPEVEGAPLFSLTEESFKACHLTLTLDDYLFIAFVGFVVS 
IASVATNFLLGITANCCHRWSKASEEEEI 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 241-260 

N-glycosylation sites . 

amino acids 52-55, 81-84, 107-110 

Tyrosine kinase phosphorylation site. 

amino acids 148-154 

N-myristoylation sites . 

amino acids 11-15, 263-268 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 175-185 

Leucine zipper pattern. 

amino acids 77-98 
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GGCTGCGCCCAGGCCGGCGGGCCCAGCAGCTGCGAACCGCCGGCGCACCACCTGTTTCCGCGC 
CCGGGGACTTCCCCGGCGGGGCTCAGAAGTGTGGGGTCGGTCGCTTGGCTTCCCCTGGCGTCA 
GCGACCCAGGGTAACCTCCTCCACTGCTGCGTGCCGTGCAGGCCTGCCTGTGTGAGAGCCACG 
TGTGCCGCGCTCTGGGCACAGCCTTGGAAAGTCAGGACCGCGACGGCAGCAGAGCAGAAACCT 
TACAGAAACATGAAGCCCTCAACCATCTGCTACTCAGTTATTCGGGGCTGACGGCGGCTTCTA 
GAACATCCAGGTGTTCTGCAGATGCGAGAACTCATCCTGTAGTCACCAGATGGAGTCCCAAAC 
AGCCAAGCAGATGTAAGGCCTGTGCTGTGGCTCTGAGGCCCTGAATACAGAAGGGTCACTTTC 
TTAGTGGCCAAAGAGCAGTTGTTGACATTGATGTCTAATTATTGAACACGACCAGTCATTTTA 
CTGAGCTGCAGTGAGGAAACACTGACCATAGAAGATCAAGCCAAATGAGGGATTGCAAATTTC 
CTGATTCTTTTGAATTAGGATTCCAGATGGGGGCCTCATTTCTACAGCCCCCAACATTCCTAT 
AGCCGTTATCACTGCCATCACCACTGCCACCAGCATCTTCTTGCAGATTCCACCCCTGCTCCC 
CAGAGACTTCCTGCTTTGAAAGTGAGCAGAAAGGAAGCTCTCAGAAAAATCTCTAGTGGTGGC 
TGCCGTCGCTCCAGACAATCGGAATCCTGCCTTCACCACCATGGGCTGGCTTTTTCTAAAGGT 
TTTGTTGGCGGGAGTGAGTTTCTCAGGATTTCTTTATCCTCTTGTGGATTTTTGCATCAGTGG 
GAAAACAAGAGGACAGAAGCCAAACTTTGTGATTATTTTGGCCGATGACATGGGGTGGGGTGA 
CCTGGGAGCAAACTGGGCAGAAACAAAGGACACTGCCAACCTTGATAAGATGGCTTCGGAGGG 
AATGAGGTTTGTGGATTTCCATGCAGCTGCCTCCACCTGCTCACCCTCCCGGGCTTCCTTGCT 
CACCGGCCGGCTTGGCCTTCGCAATGGAGTCACACGCAACTTTGCAGTCACTTCTGTGGGAGG 
CCTTCCGCTCAACGAGACCACCTTGGCAGAGGTGCTGCAGCAGGCGGGTTACGTCACTGGGAT 
AATAGGCAAATGGCATCTTGGACACCACGGCTCTTATCACCCCAACTTCCGTGGTTTTGATTA 
CTACTTTGGAATCCCATATAGCCATGATATGGGCTGTACTGATACTCCAGGCTACAACCACCC 
TCCTTGTCCAGCGTGTCCACAGGGTGATGGACCATCAAGGAACCTTCAAAGAGACTGTTACAC 
TGACGTGGCCCTCCCTCTTTATGAAAACCTC7VACATTGTGGAGCAGCCGGTGAACTTGAGCAG 
CCTTGCCCAGAAGTATGCTGAGAAAGCAACCCAGTTCATCCAGCGTGCAAGCACCAGCGGGAG 
GCCCTTCCTGCTCTATGTGGCTCTGGCCCACATGCACGTGCCCTTACCTGTGACTCAGCTACC 
AGCAGCGCCACGGGGCAGAAGCCTGTATGGTGCAGGGCTCTGGGAGATGGACAGTCTGGTGGG 
CCAGATCAAGGACAAAGTTGACCACACAGTGAAGGAAAACACATTCCTCTGGTTTACAGGAGA 
CAATGGCCCGTGGGCTCAGAAGTGTGAGCTAGCGGGCAGTGTGGGTCCCTTCACTGGATTTTG 
GCAAACTCGTCAAGGGGGAAGTCCAGCCAAGCAGACGACCTGGGAAGGAGGGCACCGGGTCCC 
AGCACTGGCTTACTGGCCTGGCAGAGTTCCAGTTAATGTCACCAGCACTGCCTTGTTAAGCGT 
GCTGGACATTTTTCCAACTGTGGTAGCCCTGGCCCAGGCCAGCTTACCTCAAGGACGGCGCTT 
TGATGGTGTGGACGTCTCCGAGGTGCTCTTTGGCCGGTCACAGCCTGGGCACAGGGTGCTGTT 
CCACCCCAACAGCGGGGCAGCTGGAGAGTTTGGAGCCCTGCAGACTGTCCGCCTGGAGCGTTA 
CT^AGGCCTTCTACATTACCGGTGGAGCCAGGGCGTGTGATGGGAGCATGGTGCCTGAGCTGCA 
GCATAAGTTTCCTCTGATTTTCAACCTGGAAGACGATACCGCAGAAGCTGTGCCCCTAGAAAG 
AGGTGGTGCGGAGTACCAGGCTGTGCTGCCCGAGGTCAGAAAGGTTCTTGCAGACGTCCTCCA 
AGACATTGCCAACGACAACATCTCCAGCGCAGATTACACTCAGGACCCTTCAGTAACTCCCTG 
CTGTAATCCCTACCAAATTGCCTGCCGCTGTCAAGCCGCATAACAGACCAATTT.TTATTCCAC 
GAGGAGGAGTACCTGGAAATTAGGCAAGTTTGCTTCCAAATTTCATTTTTACCCTCTTTACAA 
ACACACGCTTTAGTTTAGTCTTGGAGTTTAGTTTTGGAGTTAGCCTTGCATATCCCTTCTGTA 
TCCTGTCCCCCCTCCACGCCGACCCGAGAGCAGCTGAGCTGCGCTGGCTCTGGGCAGGGAGTG 
TGCCTTAATGGGAAGCACACGG'GCTTTGGAGTCAGGCACAGGTGCCAGCTCCAGCTTTTGAAC 
TTGGGCAATTGTTTAACCTAACCTGCAAGTTGATTTTGAGGGTTAAATAAAGGCATACATGAA 
AATGCCTGGCAACTTTAAAAAAAAAAAA 
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FIGURE 56 

MGWLFLKVLLAGVSFSGFLYPLVDFCISGKTRGQKPNFVIILADDMGWGDLGANWAETKDTAN 
LDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGVTRNFAVTSVGGLPLNETTLAEVLQ 
QAGYVTGIIGKWHLGHHGSYHPNFRGFDYYFGIPYSHDMGCTDTPGYNHPPCPACPQGDGPSR 
NLQRDCYTDVALPLYENLNIVEQPVNLSSLAQKYAEKATQFIQRASTSGRPFLLYVALAHMHV 
PLPVTQLPAAPRGRSLYGAGLWEMDSLVGQIKDKVDHTVKENTFLWFTGDNGPWAQKCELAGS 
VGPFTGFWQTRQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALAQA 
SLPQGRRFDGVDVSEVLFGRSQPGHRVLFHPNSGAAGEFGALQTVRLERYKAFYITGGARACD 
GSMVPELQHKFPLIFNLEDDTAEAVPLERGGAEYQAVLPEVRKVLADVLQDIANDNISSADYT 
QDPSVTPCCNPYQIACRCQAA 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain : 

amino acids 353-373 

N-glycosylation sites* 

amino acids 117-120, 215-218, 356-359, 397-500 
N-myristoylation sites. 

amino acids 12-17, 33-38, 52-57, 97-102, 101-106, 113-118, 158- 
163, 328-333, 388-393, 418-423, 435-440, 436-441 

Amidation site. 

amino acids 382-385 

Sulf atases signature 2 . 

amino acids 129-138 
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FIGURE 57 

TGGACAAGACACCTCCAGGAGCCCAGCTCACAGCCACCGGTACCTTCTTCCAGGACAAGCTGG 
GGGCCTCCATGGGCGCCTGAGGGCCAGGCGCCAGGGCCGTGGGCACGAGTATGGTGAGACACC 
AGCCCCTGCAGTACTACGAGCCACAGCTGTGCCTCTCCTGCCTCACGGGCATCTACGGCTGCC 
GTTGGAAGCGCTACCAGCGCTCCCATGATGATACCACACCGGGCACAGCGCCATTCCTGCATG 
TGGGGGCTGTGGCAGCAGTCACCATGCTCTCCTGGATCGTGGCAGGACAGTTCGCCCGTGCAG 
AGCGGACCTCCTCCCAGGTGACCATTCTCTGTACCTTCTTCACCGTGGTGTTTGCCCTCTACC 
TGGCCCCTCTCACCATCTCCTCTCCCTGCATCATGGAGAAGAAAGACCTCGGCGCCAAGCCTG 
CTCTCATTGGCCACCGCGGGGCCCCCATGCTGGCTCCAGAGCACACGCTCATGTCCTTCCGGA 
AGGCCCTCGAGCAGAAGCTGTACGGGCTCCAGGCTGACATTACCATCAGCCTGGACGGCGTGC 
CCTTCCTCATGCATGACACCACCCTGCGGCGCACCACCAACGTGGAGGAGGAGTTCCCGGAGC 
TGGCCCGCAGGCCTGCCTCCATGCTTAACTGGACCACCCTGCAGAGACTCAACGCTGGCCAGT 
GGTTCCTGAAGACTGACCCCTTCTGGACAGCCAGCTCCCTGTCACCCTCCGACCACAGAGAGG 
CCCAGAACCAGTCCATCTGCAGCCTGGCAGAGCTCCTGGAGCTGGCCAAGGGCAATGCCACAC 
TGCTGCTCAACCTGCGTGACCCGCCCCGGGAGCACCCCTACCGCAGCAGTTTTATCAACGTGA 
CTCTGGAGGCCGTGCTGCACTCCGGCTTCCCCCAGCACCAGGTCATGTGGCTGCCTAGCAGGC 
AGAGGCCCCTGGTGCGGAAGGTGGCTCCCGGCTTCCAACAGACATCAGGCTCCAAGGAGGCAG 
TCGCCAGCCTGCGGAGAGGCCACATCCAGCGGCTGAACCTGCGCTACACTCAGGTGTCCCGCC 
AGGAGCTCAGGGACTACGCGTCCTGGAACCTGAGTGTGAACCTCTACACAGTCAACGCACCGT 
GGCTCTTCTCCCTGCTGTGGTGTGCGGGGGTCCCATCCGTCACCTCTGACAACTCCCACACCC 
TGTCCCAGGTGCCTTCCCCCCTCTGGATCATGCCCCCGGACGAGTACTGTCTCATGTGGGTCA 
CTGCCGACCTGGTCTCCTTCACCCTCATCGTGGGCATCTTCGTGCTCCAGAAGTGGCGCCTGG 
GTGGCATACGGAGCTACAACCCTGAGCAGATCATGCTGAGTGCTGCGGTGCGCCGGACCAGCC 
GGGACGTCAGCATCATGAAGGAGAAGCTTATTTTCTCAGAGATCAGCGATGGTGTAGAGGTCT 
CCGATGTGCTCTCCGTATGTTCAGACAACAGTTATGACACATATGCCAACAGCACCGCCACCC 
CTGTGGGCCCCCGAGGGGGTGGCAGCCACACCAAGACCCTCATAGAGCGGAGTGGGCGTTAGC 
TGAAGACATGTCTGTCCCACCTGTACCTGACACAGAAGCTGGGGAGCCTAGGAGAGCTGGTGG 
AAGTGTGTCTGAACTCGGAGTGCTCTGGGAGCGGGCT-CCACAGCCTCCTTGTGGGCTCCAGCC 
CCTTGTCAGCCGCAGCCTCTCTTGAGGGGGACTCCCTGTCTCCTGAGGCCCAGCTGGGCCAGG 
ACTCCATCCTTTCAGATGCCCCTGCAGGCCTGGGGCTCCTTCTGGGAAGTATGGGGCCTAGGG 
CTTGGTCCCCCTCTTCTGAGGCCCTCTCCTGTATCCCGACCTGGAAGCTTTGATGGGTCATGG 
GCCATGCCATACCCCCTGTGGCAATGGAGTGTGTGGATGCTCACCTGTGCCATCTGTCCTCCT 
GTCTGTGCCAGGAGGCACCTGAGTTCTCTGCTGTTATCCTGCCCCAAGGGCCTGGGC.CGAGCC 
TCTACCTGAAGCAACTCTGCTCTTCCTGTCAGTCTCAAAGCACAAGGAGGTTCAGCCCAGGAG 
GAAGCCAGCTGCAATGTGGAGACACGTCCTCCTCCCCAACCCACCTCATGCCACCGCCAACCC 
CCTGCCCCAGGAGCGGGCCTGAGCCACGTCCCCTAGGAGCAGCTGGAGATGGCCAAAAGAGTG 
AGCTCAGGACTACTGGATCCCATGCCCAGGTGTCCAGCAGACCTCAAGGCAGAAGGGTCACCT 
AACCCAGGAGTCCACAGACTGATGTGACCTCAGGTTCCCACATCAGTGGCCACAGGGCAGGGC 
CCACCTGGTAGAAGTGTTCTGGATATQGCCAGGGTGGGTGTGTGGCTAAGTGGGCCTGAACAG 
AGGGAACCTAGGGCCCTTGGCCAATGTGATTAAAGCTGCCATCTTGAAA 
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FIGURE 58 

MVRHQPLQYYEPQLCLSCLTGIYGCRWKRYQRSHDDTTPGTAPFLHVGAVAAVTMLSWIVAGQ 
FARAERTSSQVTILCTFFTWFALYLAPLTISSPCIMEKKDLGPKPALIGHRGAPMLAPEHTL 
MSFRKALEQKLYGLQADITISLDGVPFLMHDTTLRRTTNVEEEFPELARRPASMLNWTTLQRL 
NAGQWFLKTDPFWTASSLSPSDHREAQNQSICSLAELLELAKGNATLLLNLRDPPREHPYRSS 
FINVTLEAVLHSGFPQHQVMWLPSRQRPLVRKVAPGFQQTSGSKEAVASLRRGHIQRLNLRYT 
QVSRQELRDYASWNLSVNLYTVNAPWLFSLLWCAGVPSVTSDNSHTLSQVPSPLWIMPPDEYC 
LMWVTADLVSFTLIVGIFVLQKWRLGGIRSYNPEQIMLSAAVRRTSRDVSIMKEKLIFSEISD 
GVEVSDVLSVCSDNSYDTYANSTATPVGPRGGGSHTKTLIERSGR 

Important features of the protein: 
Signal peptide : 

amino acids 1-24 

Transmembrane domains : 

amino acids 47-61, 77-93, 335-350, 380-399 
N-glycosylation sites. 

amino acids 182-186., 217-221, 233-237, 255-259, 329-333, 462-466 

Tyrosine kinase phosphorylation site. 

amino acids 130-139 

N-myristoylation sites. 

amino acids 21-27, 48-54, 294-300, 404-410, 442-448, 473-479 
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FIGURE 59 

CCTGAGCAAACACAGCAGCCCGAGTGTTCCCAAGGCCAAAATGCTGAGAACGTCCACTCCTAA 
TCTGTGTGGTGGTCTGCATTGCCGGGCCCCCTGGCTCTCTTCTGGCATTCTCTGCCTCTGCCT 
CATATTCTTGTTAGGCCAGGTGGGCTTGCTGCAGGGACACCCCCAGTGCCTGGATTACGGGCC 
CCCTTTCCAGCCCCCTCTGCACCTTGAGTTTTGCTCTGACTATGAGTCCTTCGGCTGCTGTGA 
TCAGCACAAGGACCGCCGCATCGCTGCCCGGTACTGGGACATCATGGAATATTTTGATCTGAA 
GAGACATGAGCTGTGTGGAGATTACATTAAAGACATCCTTTGCCAGGAGTGCTCGCCCTACGC 
AGCCCACCTCTACGACGCCGAAAACACCCAGACGCCTCTCCGGAATCTCCCGGGCCTCTGCTC 
TGATTACTGCTCTGCCTTCCATTCTAACTGTCACTCAGCCATTTCCCTGCTGACCAATGACCG 
CGGCCTCCAGGAGTCTCATGGAAGGGACGGTACCCGCTTCTGCCACCTCCTGGACCTTCCTGA 
CAAGGACTATTGCTTCCCTAATGTCCTGAGGAACGACTATCTCAACCGCCACCTGGGCATGGT 
GGCCCAAGATCCTCAGGGCTGCCTGCAGCTCTGCCTGAGCGAGGTGGCCAACGGGCTGAGGAA 
CCCCGTCTCCATGGTCCATGCTGGGGACGGCACCCATCGCTTCTTTGTTGCCGAGCAGGTAGG 
AGTGGTGTGGGTCTACCTCCCTGATGGGAGTCGCCTGGAGCAACCCTTCCTGGACCTCAAGAA 
CATCGTGTTGACCACCCCATGGATCGGGGATGAGAGAGGCTTCTTGGGGTTGGCTTTTCACCC 
CAAATTCCGCCACAATCGCAAGTTCTATATTTATTATTCGTGCCTGGACAAGAAGAAGGTAGA 
AAAGATCCGAATTAGTGAGATGAAGGTTTCTCGGGCTGATCCTAACAAAGCTGACCTGAAATC 
AGAGAGGGTCATCTTGGAGATTGAAGAACCAGCCTCAAACCATAATGGCGGACAACTTCTTTT 
TGGCCTGGATGGCTATATGTACATATTCACTGGGGACGGGGGACAGGCTGGAGATCCCTTTGG 
CCTGTTTGGAAATGCTCAGAACAAAAGTTCCCTGCTGGGAAAAGTTTTAAGGATCGATGTGAA 
CAGGGCAGGCTCACATGGCAAGCGGTACCGAGTCCCCTCGGACAATCCATTTGTTTCTGAGCC 
AGGGGCCCACCCCGCCATCTATGCCTATGGGATCAGGAACATGTGGCGTTGTGCTGTGGACCG 
AGGGGACCCCATCACGCGCCAGGGCCGAGGCCGGATATTCTGTGGGGACGTGGGCCAGAACAG 
GTTTGAAGAGGTTGACCTCATTTTGAAAGGTGGAAACTATGGCTGGAGAGCAAAGGAAGGGTT 
TGCATGTTATGACAAAAAACTTTGTCACAATGCCTCTTTGGATGATGTTCTGCCAATCTATGC 
TTATGGCCATGCAGTGGGGAAGTCAGTCACTGGAGGTTATGTCTATCG.TGGTTGTGAATCCCC 
AAATCTCAATGGCCTGTATATCTTTGGAGACTTCATGAGTGGTCGACTTATGGCTTTGCAGGA 
AGATAGAAAAAACAAGAAATGGAAGAAGCAGGATCTTTGCCTGGGCAGCACCACGTCCTGTGC 
CTTCCCAGGGCTGATCAGCACCCATAGCAAGTTCATCATCTCCTTTGCTGAAGATGAAGCAGG 
GGAGCTGTATTTCCTGGCGACCTCTTACCCAAGTGCCTATGCACCACGTGGATCTATTTACAA 
GTTTGTTGACCCCTCAAGGCGAGCACCCCCAGGCAAGTGCAAATACAAGCCAGTGCCCGTGAG 
AACCAAGAGTAAGCGGATCCCGTTCAGACCACTCGCCAAGACAGTCTTGGACTTGCTAAAGGA 
ACAATCAGAGAAAGCTGCTAGAAAATCTTCCAGTGCAACCTTAGCTTCTGGCCCAGCCCAGGG 
TTTGTCTGAGAAAGGCTCCTCCAAGAAGCTGGCTTCTCCTACAAGCAGCAAGAATACATTGCG 
AGGGCCTGGTACAAAGAAGAAAGCCAGAGTGGGGCCCCACGTCCGCCAGGGCAAGAGGAGGAA 
GAGCCTGAAAAGCCACAGTGGCAGGATGAGGCCATCAGCAGAGCAGAAGCGAGCTGGCAGAAG 
TCTCCCTTGACCTATTGGTCAAGGTGGCCGACAGGGTGACGTGAGAGAGGAGAGCCACCTCAT 
CAAATGAAAGTCACTGCTGAATAAAGACCTTAGAAGTCTGGGAAGCCAGGGTAGAGGTGGGGC 
AGGGCGGTTTTCCTCTCCCTGGGAAATCTTGCTGTCTACTGAATAAATAAATGCACCTTCTCT 
GTATGCAGTGCTTCTGTGGGAGACCATATCCCAGATTGCTGGTGCACCTGGGTTATGGTAAGC 
ACTAGTCCATGAGCCTGCTTGGAATCACACTGGATGTCTCCGTTTTGTCTTGTAAATGCCTAC 
AACCTGAGGTAATAAATCAACATTTGCTCA 
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FIGURE 60 

MLRTSTPNLCGGLHCRAPWLSSGILCLCLIFLLGQVGLLQGHPQCLDYGPPFQPPLHLEFCSD 
YESFGCCDQHKDRRIAARYWDIMEYFDLKRHELCGDYIKDILCQECSPYAAHLYDAENTQTPL 
RNLPGLCSDYCSAFHSNCHSAISLLTNDRGLQESHGRDGTRFCHLLDLPDKDYCFPNVLRNDY 
LNRHLGMVAQDPQGCLQLCLSEVANGLRNPVSMVHAGDGTHRFFVAEQVGVVWVYLPDGSRLE 
QPFLDLKNIVLTTPWIGDERGFLGLAFHPKFRHNRKFYIYYSCLDKKKVEKIRISEMKVSRAD 
PNKADLKSERVILEIEEPASNHNGGQLLFGLDGYMYIFTGDGGQAGDPFGLFGNAQNKSSLLG 
KVLRIDVNRAGSHGKRYRVPSDNPFVSEPGAHPAIYAYGIRNMWRCAVDRGDPITRQGRGRIF 
CGDVGQNRFEEVDLILKGGNYGWRAKEGFACYDKKLCHNASLDDVLPIYAYGHAVGKSVTGGY 
VYRGCESPNLNGLYIFGDFMSGRLMALQEDRKNKKWKKQDLCLGSTTSCAFPGLISTHSKFII 
SFAEDEAGELYFLATSYPSAYAPRGSIYKFVDPSRRAPPGKCKYKPVPVRTKSKRIPFRPLAK 
TVLDLLKEQSEKAARKSSSATLASGPAQGLSEKGSSKKLASPTSSKNTLRGPGTKKKARVGPH 
VRQGKRRKSLKSHSGRMRPSAEQKRAGRSLP 

Important features of the protein: 
Signal peptide: 

amino acids 1-41 
Transmembrane domain: 

amino acids 17-36 
N-glycosylation sites. 

amino acids 372-376, 480-484 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 645-649, 699-703 
Tyrosine kinase phosphorylation site, 
amino acids 81-89 
N-myristoylation sites . 

amino acids 11-17, 37-43, 156-162, 165-171, 357-363, 365-371, 

368-374, 408-414, 459-465, 548-554, 557-563 

Amidation sites. 

amino acids 391-395, 696-700 

Cell attachment sequence. 

amino acids 428-431 

Leucine zipper pattern. 

amino acids 25-47 
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CTCCATTA7U\CCACCACCAGCTCCCCAAGCCACCCCTTCAGCCATOAAGTTCCTGCTCCTGGT 
CTTGGCAGCCCTCGGATTCCTGACCCAGGTGATCCCAGCCAGTGCAGGTGGGTCAAAATGTGT 
GAGTAACACCCCAGGATACTGCAGGACATGTTGCCACTGGGGGGAGACAGCATTGTTCATGTG 
CAACGCTTCCAGAAAATGCTGCATCAGCTACTCCTTCCTGCCGAAGCCTGACCTACCACAGCT 
CATCGGTAACCACTGGCAATCAAGGAGAAGAAACACACAAAGGAAAGACAAGAAGCAACAAAC 
GACCGTAACATCATAATAACCACTGCTATCGCCTCCACCAACTCAGAGAAATATCATTTCCAC 
AGTTCCAATTCCTCCTACATTGCTGAGTACTAGCCAAGGCTCCTCTTTATGGGGCAGATATCT 
ATAGCCAACCCCAAAACTTCTGTCTTCTATCATTCTGTCATTCATCTAGTAACTAATTTGGAG 
TTTGTATCTATCTTACGAGAACAATCATCATGCAGATTCGTCCACAGGGGATCTGTCAGTTTG 
GGTCCTCCAAATGAAAAATGTCAAGACAGAATTGGACATGCAAAAGATTGACTGGGAGAACAC 
ACCTCTGATGGACAAAGGTGAGACAGAGCAGCCACAGGCAGGGAGAGCCTTCAGACTGCAACG 
CTGGCCTGATACGTGTCAAAGGAGAGAGGGATAGAGGAGGATTGAATAGAAGGAGACTAAGAC 
TGCAGCTCTAAGAAAGTCTCAGCCAAACAGATGGGGAGGCCCAAAGCAAGGCTTGCCCCTCAG 
AGGAGCTCACGCAGGGCAGGAATAGCCAGGTTCTCATATCCCAGGGGTTCAGACTTGGCTGAG 
AACAGCCCCTGGAGAACATGGGGTGACTGCTACCATAGGTCTGGAAGTATGAGGCTGTCCACC 
AACTATCCCCTTGAAGCAAGTTCTCTTGAAAGGAAATCTAAACAGTGCACCCCCATGGCTGCC 
ACGGAGTATAAGGAGGGAGAGAAAGGAGCTGAAAGTCTAGGTTTGGCCAGCTAGGTAGACTGA 
CTTGTGAGGTATTTATTTATTCATTTGAGTAACAAAGCAGACAGAATACATAGCCACCATTGG 
TAGTACACCCCAAAAGCAAGGATGGCATGATGCTGGTGACTCAAACGTGCCTACTCATGGTGT 
CAAATTGGCATAATCCTCTTGGGAAGCTGTGTGGAAATAAGCACAGAGAAGCAGAACTCTAAT 
TGCTTAATCCACTAAACATTACTTCTGGGAATTGGCTCATCATAAATTATCCAAGAGAAAGCA 
CAAAGTTATGGGCACAAAGGTTTTCCATATAATATTATTTAAAATGCTGAGAAAATGAAAAAA 
TCTAAATGGTGAAATATATACTAATGCCATCTATAAATACAAACAAATAGAATGTTTATAGAA 
TAATGGAACATAATAACATTATTCAAAATTGCATTTATGCTATAGTTGTCAAAATTGTCTCCT 
TATATGATACAAAACTCATGAAAATTATGACTTTTTTGTTTGGTTGGAAAGCAGAATTATGCA 
TAAATTTCCTCTTACAGTTCGATGCCCATTAGTTTTATATAACATTTATTTGACACGTACTGA 
CTTCTATCTGAGAAGAACAAACCAAAACACTCAGGCCTAAATAATTAAAAACGGTCCTAAAAA 
CTAGCAAACCAGATAAGAAAAGATGTTAATGCCCATTCCCTAACTTATGTCTTAGACCAAAAT 
TAATTCTAGATGGTTTTAAAATGACAGTGTAAAAGTAAAGTATTAAAAGATTGTGTGGTCAAA 
TATTCAATTTAAGAGCAAGGAT^ATTCTTATAAATATAACAATAGAGGCAGAACTCATGTAAGA 
ATAAATTGATTAGGTGGTATTAAATATTAAGTTCTTATGTATGTCAAAAGATATCATTTTGAA 
ATTCATCCATCTTATTGGGTATTGCAGGAGTTCATTCCTTTTTGTTTATAAATACTCTTCCGT 
CATATGAATAGTATTCATTTGTATACTGGTTTGTTGATGGACATTTGGGTTGTTCCCAGTTTA 
TGGCTATTACAAATAAAGCTTCTATGAACATTTATGTACA 
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FIGURE 62 

MKFLLLVLAALGFLTQVIPASAGGSKCVSNTPGYCRTCCHWGETAL FMCNASRKCCISYSFLP 
KPDLPQLIGNHWQSRRRNTQRKDKKQQTTVTS 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 1-22 

N-glycosylation site. 

amino acids 50-53 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 79-82 

N-myristoylation site. 

amino acids 23-28 
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FIGURE 63 

GCGGAGCGCCTGGGAGAGGAGAAGGAGCCGACCTGCCGAGATGGAGGCGACCGGCACCTGGGC 
GCTGCTGCTGGCGCTGGCGCTGCTCCTGCTGCTGACGCTGGCGCTGTCCGGGACCAGGGCCCG 
AGGCCACCTGCCCCCCGGGCCCACGCCGCTACCACTGCTGGGAAACCTCCTGCAGCTACGGCC 
CGGGGCGCTGTATTCAGGGCTCATGCGGCTGAGTAAGAAGTACGGACCGGTGTTCACCATCTA 
CCTGGGACCCTGGCGGCCTGTGGTGGTCCTGGTTGGGCAGGAGGCTGTGCGGGAGGCCCTGGG 
AGGTCAGGCTGAGGAGTTCAGCGGCCGGGGAACCGTAGCGATGCTGGAAGGGACTTTTGATGG 
CCATGGGGTTTTCTTCTCCAACGGGGAGCGGTGGAGGCAGCTGAGGAAGTTTACCATGCTTGC 
TCTGCGGGACCTGGGCATGGGGAAGCGAGAAGGCGAGGAGCTGATCCAGGCGGAGGCCCGGTG 
TCTGGTGGAGACATTCCAGGGGACAGAAGGACGCCCATTCGATCCCTCCCTGCTGCTGGCCCA 
GGCCACCTCCAACGTAGTCTGCTCCCTCCTCTTTGGCCTCCGCTTCTCCTATGAGGATAAGGA 
GTTCCAGGCCGTGGTCCGGGCAGCTGGTGGTACCCTGCTGGGAGTCAGCTCCCAGGGGGGTCA 
GACCTACGAGATGTTCTCCTGGTTCCTGCGGCCCCTGCCAGGCCCCCACAAGCAGCTCCTCCA 
CCACGTCAGCACCTTGGCTGCCTTCACAGTCCGGCAGGTGCAGCAGCACCAGGGGAACCTGGA 
TGCTTCGGGCCCCGCACGTGACCTTGTCGATGCCTTCCTGCTGAAGATGGCACAGGAGGAACA 
AAACCCAGGCACAGAATTCACCAACAAGAACATGCTGATGACAGTCATTTATTTGCTGTTTGC 
TGGGACGATGACGGTCAGCACCACGGTCGGCTATACCCTCCTGCTCCTGATGAAATACCCTCA 
TGTCCAAAAGTGGGTACGTGAGGAGCTGAATCGGGAGCTGGGGGCTGGCCAGGCACCAAGCCT 
AGGGGACCGTACCCGCCTCCCTTACACCGACGCGGTTCTGCATGAGGCGCAGCGGCTGCTGGC 
GCTGGTGCCCATGGGAATACCCCGCAGCCTCATGCGGACCACCCGCTTCCGAGGGTACACCCT 
GCCCCAGGGCACGGAGGTCTTCCCCCTCCTTGGCTCCATCCTGCATGACCCCAACATCTTCAA 
GCACCCAGAAGAGTTCAACCCAGACCGTTTCCTGGATGCAGATGGACGGTTCAGGAAGCATGA 
GGCGTTCCTGCCCTTCTCCTTAGGGAAGCGTGTCTGCCTTGGAGAGGGCCTGGCAAAAGCGGA 
GCTCTTCCTCTTCTTCACCACCATCCTACAAGCCTTCTCCCTGGAGAGCCCGTGCCCGCCGGA 
CACCCTGAGCCTCAAGCCCACCGTCAGTGGCCTTTTCAACATTCCCCCAGCCTTCCAGCTGCA 
AGTCCGTCCCACTGACCTTCACTCCACCACGCAGACCAGATGAAGGAAGGCAACTTGGAAGTG 
GTGGGTGCCCAGGACGGTGCCTCCAGCCTCAACAGTGGGCATGGACAGGGTTAATGTCTCCAG 
AGTGTACACTGCAGGCAGCCACATTTACACGCCTGCAGTTGTTTTCCGGAGTCTGTCCCACGG 
CCCACACGCTCACTTGACTCATGCTGCTAAGATGCACAACCGCACACCCATACACAACTACAA 
GGGCCACAAAGCAACTGCTGGGTTAGCTTTCCACAGACATAAATATAGTCCATCTGCAATCAC 
AAGCACATAGCCAGGTAACCCACCAACTCCCCTGGATCTGCAGCCCACACGTGGGAGTCTGGC 
TGTCACCTTCACAAGCCACAGAAACGGCCACACATGTTCACAGCTCACACGCCCTCTCCATTC 
ATCGAACTTCTCAGTGTCCCTGTCCCTGGTGCCTGGCACAGGGAACAGCATGCCCCCTCCGGG 
GTCATGCCACCCAGAGACTGTCGCTGTCTATGGCCCCAACTCATGCTCCCTCTCTTGGCTACA 
CCACTCTCCCAGCCTGTGACCACCGATGTCCACACACCCCCAACCACTTGTCCACACAGCTAC 
CCACGTACAACATCGTCCTGGCTCCCCAGAGTATCTTCCCACTGAGACACGCCGCCCCCACAG 
AGGCACAGTCCCCAGCCACCTCTGCAACTGCAGCCCTCAGTCACCCCTTTTTAAGCACCCTGA 
TTCTACCAAATGCAAACACATCTGGGTCTGCGATTATGCACAGAGACTTTGGACATACGAGGA 
CCCTCAGACCGGAGGAACACCTGCCCAACCCCAACACGTGCTTATGTAACCACGTGGAAAGCG 
GCCCCTGCTGCCCCTCCACACACACATACACACTCACTGATCTACAGCCCCTGTTCGGCGTCA 
GAGTCCCCACTAGACCCAGTGGAAGGGGTTAGAGACCAAGTAGGGGCCAGTTTCCAATTCACC 
CTGTCAGGGAGTGAGCCGGATCTGACGTTCCTTGTGACTTAAGGGTCCGGCTTGGGAATTAAA 
GTTTGTTTCTGGCCTTTAGCCTAAAAAAAAAAAAAAAAAA 
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FIGURE 64 

MEATGTWALLLALALLLLLTLALSGTRARGHLPPGPTPLPLLGNLLQLRPGALYSGLMRLSKK 
YGPVFTIYLGPWRPVWLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGHGVFFSNGERWRQ 
LRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTEGRPFDPSLLLAQATSNVVCSLLFGL 
RFSYEDKEFQAVVRAAGGTLLGVSSQGGQTYEMFSWFLRPLPGPHKQLLHHVSTLAAFTVRQV 
QQHQGNLDASGPARDLVDAFLLKMAQEEQNPGTEFTNKNMLMTVIYLLFAGTMTVSTTVGYTL 
LLLMKYPHVQKWVREELNRELGAGQAPSLGDRTRLPYTDAVLHEAQRLLALVPMGIPRTLMRT 
TRFRGYTLPQGTEVFPLLGSILHDPNIFKHPEEFNPDRFLDADGRFRKHEAFLPFSLGKRVCL 
GEGLAKAELFLFFTTILQAFSLESPCPPDTLSLKPTVSGLFNIPPAFQLQVRPTDLHSTTQTR 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 294-313 

Glycosaminoglycan attachment site. 

amino acids 99-103 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 128-132 

N-myristoylation sites. 

amino acids 51-57, 109-115, 115-121, 188-194, 207-213, 257-263, 
284-290, 339-345, 370-376, 444-450 

Amidation sites. 

amino- acids 140-144, 435-439 

Leucine zipper pattern. 

amino acids 32-54, 39-61 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 433-443 
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FIGURE 65 

CGGACGCGTGGGGCCGTATGCGCGGCTCTGTGGAGTGCACGTGGGGTTGGGGGCACTGTGCCC 
CCAGCCCCCTGCTCCTTTGGACTCTACTTCTGTTTGCAGCCCCATTTGGCCTGCTGGGGGAGA 
AGACCCGCCAGGTGTCTCTGGAGGTCATCCCTAACTGGCTGGGCCCCCTGCAGAACCTGCTTC 
ATATACGGGCAGTGGGCACCAATTCCACACTGCACTATGTGTGGAGCAGCCTGGGGCCTCTGG 
CAGTGGTAATGGTGGCCACCAACACCCCCCACAGCACCCTGAGCATCAACTGGAGCCTCCTGC 
TATCCCCTGAGCCCGATGGGGGCCTGATGGTGCTCCCTAAGGACAGCATTCAGTTTTCTTCTG 
CCCTTGTTTTTACCAGGCTGCTTGAGTTTGACAGCACCAACGTGTCCGATACGGCAGCAAAGC 
CTTTGGGAAGACCATATCCTCCATACTCCTTGGCCGATTTCTCTTGGAACAACATCACTGATT 
CATTGGATCCTGCCACCCTGAGTGCCACATTTCAAGGCCACCCCATGAACGACCCTACCAGGA 
CTTTTGCCAATGGCAGCCTGGCCTTCAGGGTCCAGGCCTTTTCCAGGTCCAGCCGACCAGCCC 
AACCCCCTCGCCTCCTGCACACAGCAGACACCTGTCAGCTAGAGGTGGCCCTGATTGGAGCCT 
CTCCCCGGGGAAACCGTTCCCTGTTTGGGCTGGAGGTAGCCACATTGGGCCAGGGCCCTGACT 
GCCCCTCAATGCAGGAGCAGCACTCCATCGACGATGAATATGCACCGGCCGTCTTCCAGTTGG 
ACCAGCTACTGTGGGGCTCCCTCCCATCAGGCTTTGCACAGTGGCGACCAGTGGCTTACTCCC 
AGAAGCCGGGGGGCCGAGAATCAGCCCTGCCCTGCCAAGCTTCCCCTCTTCATCCTGCCTTAG 
CATACTCTCTTCCCCAGTCACCCATTGTCCGAGCCTTCTTTGGGTCCCAGAATAACTTCTGTG 
CCTTCAATCTGACGTTCGGGGCTTCCACAGGCCCTGGCTATTGGGACCAACACTACCTCAGCT 
GGTCGATGCTCCTGGGTGTGGGCTTCCCTCCAGTGGACGGCTTGTCCCCACTAGTCCTGGGCA 
TCATGGCAGTGGCCCTGGGTGCCCCAGGGCTCATGCTGCTAGGGGGCGGCTTGGTTCTGCTGC 
TGCACCACAAGAAGTACTCAGAGTACCAGTCCATAAATTAAGGCCCGCTCTCTGGAGGGAAGG 
ACATTACTGAACCTGTCTTGCTGTGCCTCGAAACTCTGGAGGTTGGAGCATCAAGTTCCAGCC 
GGCCCCTTCACTCCCCCATCTTGCTTTTCTGTGGAACCTCAGAGGCCAGCCTCGACTTCCTGG 
AGACCCCCAGGTGGGGCTTCCTTCATACTTTGTTGGGGGACTTTGGAGGCGGGCAGGGGACAG 
GGCTATTGATAAGGTCGCCTTGGTGTTGCCTTCTTGCATCTCCACACATTTCCCTTGGATGGG 
ACTTGCAGGCCTAAATGAGAGGCATTCTGACTGGTTGGCTGCCCTGGAAGGCAAGAAAATAGA 
TTTATTTTTTTTCACAGGGAAAAAAAT^AAAA 
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FIGURE 66 

MRGSVECTWGWGHCAPSPLLLWTLLLFAAPFGLLGEKTRQVSLEVIPNWLGPLQNLLHIRAVG 
TNSTLHYVWSSLGPLAVVMVATNTPHSTLSINWSLLLSPEPDGGLMVLPKDSIQFSSALVFTR 
LLEFDSTNVSDTAAKPLGRPYPPYSLADFSWNNITDSLDPATLSATFQGHPMNDPTRT FANGS 
LAFRVQAFSRSSRPAQPPRLLHTADTCQLEVALIGASPRGNRSLFGLEVATLGQGPDCPSMQE 
QHSIDDEYAPAVFQLDQLLWGSLPSGFAQWRPVAYSQKPGGRESALPCQASPLHPALAYSLPQ 
SPIVRAFFGSQNNFCAFNLTFGASTGPGYWDQHYLSWSMLLGVGFPPVDGLSPLVLGIMAVAL 
GAPGLMLLGGGLVLLLHHKKYSEYQS IN 

N-glycosylation sites: 

amino acids 65-69, 95-99, 134-138, 159-163, 187-191, 230-234, 
333-337 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 

amino acids 397-401 

Casein kinase II phosphorylation sites: 

amino acids 151-155, 249-253, 255-259 

N-myristoylation sites: 

amino acids 3-9, 63-69, 235-241, 273-279, 292-298, 324-330 

Leucine zipper pattern. 

amino acids 371-393 
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FIGURE 67 

CGGGACAGGCGCGTGAGGCCACAACACATGCGTGTATCTTGCTTGGGCTATCTTCCCTGCTCTGCCACGCCGGGT 
CTGGAGAAGGGGTTTCAGCCCCAGGACATTTACTGAGAGTCGGCGAATATTGGGAGCCGCGATGTTCCCCCTTCG 
GGCCCTGTGGTTGGTCTGGGCGCTTCTAGGAGTGGCCGGATCATGCCCGGAGCCGTGCGCCTGCGTGGACAAGTA 
CGCTCAGCAGTTCGCGGACTGCGCTTACAAAGAGTTGCGTGAGGTGCCGGAAGGACTGCCTGCCAACGTGACGAC 
GCTTAGTCTGTCCGCGAACAAGATCACTGTGCTGCGGCGCGGGGCCTTCGCCGACGTCACACAGGTCACGTCGCT 
GTGGCTGGCGCACAATGAGGTGCGCACCGTGGAGCCAGGCGCACTGGCCGTGCTGAGTCAGCTCAAGAACCTCGA 
TCTGAGCCACAACTTCATATCCAGCTTTCCGTGGAGCGACCTGCGCAACCTGAGCGCGCTGCAGCTGCTCAAAAT 
GAACCACAACCGCCTGGGCTCTCTGCCCCGGGACGCACTCGGTGCGCTACCCGACCTGCGTTCCCTGCGCATCAA 
CAACAACCGGCTGCGTACGCTGGCGCCTGGCACCTTCGACGCGCTTAGCGCGCTGTCACACTTGCAACTCTATCA 
CAATCCCTTCCACTGCGGCTGCGGCCTTGTGTGGCTGCAGGCCTGGGCCGCGAGCACCCGGGTGTCCTTACCCGA 
GCCCGACTCCATTGCTTGTGCCTCGCCTCCCGCGCTGCAGGGGGTGCCGGTGTACCGCCTGCCCGCCCTGCCCTG 
TGCACCGCCCAGCGTGCATCTGAGTGCCGAGCCACCGCTTGAAGCACCCGGCACCCCACTGCGCGCAGGACTGGC 
GTTCGTGTTACACTGCATCGCCGACGGCCACCCTACGCCTCGCCTGCAATGGCAACTTCAGATCCCCGGTGGCAC 
CGTAGTCTTAGAGCCACCGGTTCTGAGCGGGGAGGACGACGGGGTTGGGGCGGAGGAAGGAGAGGGAGAAGGAGA 
TGGGGATTTGCTGACGCAGACCCAAGCCCAAACGCCGACTCCAGCACCCGCTTGGCCGGCGCCCCCAGCCACACC 
GCGCTTCCTGGCCCTCGCAAATGGCT.CCCTGTTGGTGCCCCTCCTGAGTGCCAAGGAGGCGGGCGTCTACACTTG 
CCGTGCACACAATGAGCTGGGCGCCAACTCTACGTCAATACGCGTGGCGGTGGCAGCAACCGGGCCCCCAAAACA 
CGCGCCTGGCGCCGGGGGAGAACCCGACGGACAGGCCCCGACCTCTGAGCGCAAGTCCACAGCCAAGGGCCGGGG 
CAACAGCGTCCTGCCTTCCAAACCCGAGGGCAAAATCAAAGGCCAAGGCCTGGCCAAGGTCAGCATTCTCGGGGA 
GACCGAGACGGAGCCGGAGGAGGACACAAGTGAGGGAGAGGAGGCCGAAGACCAGATCCTCGCGGACCCGGCGGA 
GGAGCAGCGCTGTGGCAACGGGGACCCCTCTCGGTACGTTTCTAACCACGCGTTCAACCAGAGCGCAGAGCTCAA 
GCCGCACGTCTTCGAGCTGGGCGTCATCGCGCTGGATGTGGCGGAGCGCGAGGCGCGGGTGCAGCTGACTCCGCT 
GGCTGCGCGCTGGGGCCCTGGGCCCGGCGGGGCTGGCGGAGCCCCGCGACCCGGGCGGCGACCCCTGCGCCTACT 
CTATCTGTGTCCAGCGGGGGGCGGCGCGGCAGTGCAGTGGTCCCGCGTAGAGGAAGGCGTCAACGCCTACTGGTT 
CCGCGGCCTGCGGCCGGGTACCAACTACTCCGTGTGCCTGGCGCTGGCGGGCGAAGCCTGCCACGTGCAAGTGGT 
GTTTTCCACCAAGAAGGAGCTCCCATCGCTGCTGGTCATAGTGGCAGTGAGCGTATTCCTCCTGGTGCTGGCCAC 
AGTGCCCCTTCTGGGCGCCGCCTGCTGCCATCTGCTGGCTAAACACCCGGGCAAGCCCTACCGTCTGATCCTGCG 
GCCTCAGGCCCCTGACCCTATGGAGAAGCGCATCGCCGCAGACTTCGACCCGCGTGCTTCGTACCTCGAGTCCGA 
GAAAAGCTACCCGGCAGGCGGCGAGGCGGGCGGCGAGGAGCCAGAGGACGTGCAGGGGGAGGGCCTTGATGAAGA 
CGCGGAGCAGGGAGACCCAAGTGGGGACCTGCAGAGAGAGGAGAGCCTGGCGGCCTGCTCACTGGTGGAGTCCCA 
GTCCAAGGCCAACCAAGAGGAGTTCGAGGCGGGCTCTGAGTACAGCGATCGGCTGCCCCTGGGCGCCGAGGCGGT 
CAACATCGCCCAGGAGATTAATGGCAACTACAGGCAGACGGCAGGCTGAACCTCCGCCCGTCCGGCCCGCCCATT 
CCCGACCTCCACCTAGGGTGCCTGGGAGCAGCAGTCTAGGGCTGGCAGGACTTATGTCCCCCGTCCCCAACCTTC 
ACCTACTCCTCCCCCTTACTACTCCCCAACCTTGACTACCAGGGACTTCTATTAGGGAGTGGGCCGATTTCACCA 
GTCCCTGCTACCCACGGCTGCCATTCTCCCTGCGGGCTGAATCCCCTTCCCCGCCAAGCACAGTGTTTATCTTAC 
CCCATGCAAGACTCCACCCGCAGACGGTGGGCGATATCTATGTCCCTCCATTCCCGTCGCGATTATCTGCGAAAT 
CCACCCCGCAGCCCGCCCCACCGTGGGCTCTGGAGCCAGAGGAAACGAGCGAAGACTTTGGAAACCTCGCGGTAA 
CGCGGTGGTTTCGGGGGCCAGCCAAGGCCAGTGGAGTGCTGTGGGGTCCCACCTCGACCCCTCCTCCTCCCTTTC 
TTTCTTTCCTTTTTTTTTATTTTTTAATTTTATTTATTTATTTATTTATTTTTTGACGGAGTCTTGGTCTGTCGC 
CAGGCTGGAGTGCAGTGGCGCGATCTCGGCTCACTGCATCTTCCGCCTCCCGGGTTCAAGCGATTCTCCTGCCTC 
AGCCTGCCTAGTAGCTGGGACTACAGGCGCGCGCCACCACGACCAGCTAATTTCTTCTATTTTTAGTAGAGACGG 
GGTTTCACCATGTTGGCCAGGATGGTCTGGATCTCTTGACCTCAGGTGATCCATCTGCCTCGGCCTCTCAAAGTG 
CTGGGATTACAGGCGTGAGGCACCGCGCCCGGCCCCTCCTCCCTTTCAATCCCTACTCCCAGAAGCCGGGATTCG 
TGGCAACCCCTAGTTTTTAGTTCCAAAGCCTCCTGCCGGCAGGGAACCAAATCCTTCTGTCCTCCCACCCCCACC 
GCACTTCTGGCCAGTTGGAGTCCAGCCCGGTGCCTGGGGCGCCTTTCAGCTCCGCGCTCAGATTTTCCTGTTTTC 
GTTGTTTTCAAAGACAGCGACATTTCGGGTCTGGTGCTAACACCCCCTTCCCAGCCTCTGGGAAAATGGAGTGTG 
TGTGTCGGGGGGTAGGGAGGGAATGCGTTTTCTGTCGTCTCTCTCCTAACTTAAAGCGCCGCAGGACCGCGCGCC 
CCTTGGCGGCTGAGCCTGTGGACTTGGTCGCGGGCCAATTTCGTTGTCCGTGTGTTGGGCTTTCCGGAGGTCTGT 
GCGCCCAACAGCGCCGCTCCCGCGGCTCCACCCGACCCAGACCCTAGCTGGAAAGCGCCGGAGGCGGAGGAAGCT 
GACTGTGGCCTCCCGGGCCGCGGCTCTCTGGAGGGCTCGCGCCCTAGTTCGCACAAAGCCTGCTCGTGACTGTGC 
GACTGTGCGACGGGATCCGGATGGAGCCGAGCCCCTCCGTCCTCGCGTCTCGGTCCTCGCGTCGCCCCGCCCCAC 
CCGCCCCTGCTTCGGCGGGAATCGTGTTTGCCCGGCGTGTAGTCCCTGACAAGCGTGCCCTGTAGGAGAAAAGTC 
TGTGTCCTGTGAAGTGTGACCGTGTAGTGTAGGGGGGCGGGCGGGGGGGCGGATGGGCGGGGAGGGAGGGAAGGG 
GAGGGGCGCGGCGCGGCGACTCGGGGCGGGGTTCTTTTTTCCATTTTGAAAGAAAGCGTCGGGGTTGGGGTGGGG 
GGAGTTTCAGTCCTCGGGATCAGCCCTCTCCGCGAAGCGCAGCACAAGCGCGGGCCTGGGACGGAGTAGCCCCCC 
GGAGCCCGTGCCCTTTTCTAAACGCGTCTGTATGCAGTCAATAAAAC7\ATCGATTTGAAA 
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FIGURE 68 

MFPLRALWLVWALLGVAGSCPEPCACVDKYAHQFADCAYKELREVPEGLPANVTTLSLSANKI 
TVLRRGAFADVTQVTSLWLAHNEVRTVEPGALAVLSQLKNLDLSHNFISSFPWSDLRNLSALQ 
LLKMNHNRLGSLPRDALGALPDLRSLRINNNRLRTLAPGTFDALSALSHLQLYHNPFHCGCGL 
VWLQAWAASTRVSLPEPDSIACASPPALQGVPVYRLPALPCAPPSVHLSAEPPLEAPGTPLRA 
GLAFVLHCIADGHPTPRLQWQLQIPGGTVVLEPPVLSGEDDGVGAEEGEGEGDGDLLTQTQAQ 
TPTPAPAWPAPPATPRFLALANGSLLVPLLSAKEAGVYTCRAHNELGANSTSIRVAVAATGPP 
KHAPGAGGEPDGQAPTSERKSTAKGRGNSVLPSKPEGKIKGQGLAKVSILGETETEPEEDTSE 
GEEAEDQILADPAEEQRCGNGDPSRYVSNHAFNQSAELKPHVFELGVIALDVAEREARVQLTP 
LAARWGPGPGGAGGAPRPGRRPLRLLYLCPAGGGAAVQWSRVEEGVNAYWFRGLRPGTNYSVC 
LALAGEACHVQWFSTKKELPSLLVIVAVSVFLLVLATVPLLGAACCHLLAKHPGKPYRLILR 
PQAPDPMEKRIAADFDPRASYLESEKSYPAGGEAGGEEPEDVQGEGLDEDAEQGDPSGDLQRE 
ESLAACSLVESQSKANQEEFEAGSEYSDRLPLGAEAVNIAQEINGNYRQTAG 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 
Transmembrane domain: 
amino acids 587-610 
N-glycosylation sites . 

amino acids 52-55, 121-124, 337-340, 364-367, 474-477, 563-566 
cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 397-400 

Casein kinase II phosphorylation sites. 

amino acids 19-23, 202-205, 289-292, 246-249, 411-414, 431-434, 
433-436, 440-443, 544-547, 583-586, 650-653, 700-703 
N-myristoylation sites . 

amino acids 15-20, 48-53, 165-170, 296-301, 351-356, 362-367, 
390-395, 419-424, 514-519, 536-541, 557-562, 561-566, 610-615, 
661-666, 716-721 
Amidation site, 
amino acids 522-525 

Prokaryotic membrane lipoprotein lipid attachment sites . 

amino acids 10-20, 603-613 
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FIGURE 69 

GGCGGCGGGAGCAGCGAAGGGGGCGGCAGGGATCCTCCAGGCTGCCGGCTGGGAAGGCGTGGG 
CGACCCGGTGTGTGGCGCGCCCAGAGCCCCGCGTTTCAGCCCTAGGGAAGGAAGCCAGTTGAG 
GGAAGTTCTCCATGAATGTACGTCACAATGATGATGACCGACCAAATCCCTCTGGAACTGCCA 
CCATTGCTGAACGGAGAGGTAGCCATGATGCCCCACTTGGTGAATGGAGATGCAGCTCAGCAT 
GTTATTCTCGTTCAAGTTAATCCAGGTGAGACTTTCACAATAAGAGCAGAGGATGGAACACTT 
CAGTGCATTCAAGGACCTGCTGAAGTTCCCATGATGTCACCCAATGGATCCATTCCTCCCATT 
CATGTGCCTCCAGGTTATATCTCACAGGTGATTGAAGATAGTACTGGAGTCCGCCGGGTGGTG 
GTCACACCCCAGTCTCCTGAGTGTTATCCCCCAAGCTACCCCTCAGCCATGTCTCCAACCCAT 
CATCTCCCTCCCTATCTGACTCACCATCCACATTTTATTCATAACTCACACACGGCTTACTAC 
CCACCTGTTACCGGACCTGGAGATATGCCGCCTCAGTTTTTTCCCCAGCATCATCTTCCCCAC 
ACAATATATGGTGAGCAAGAAATTATACCATTTTATGGAATGTCAAGCTACATCACCCGAGAA 
GACCAGTACAGCAAGCCTCCGCACAAAAAACTGAAAGACCGCCAGATCGATCGCCAGAACCGC 
CTCAACAGCCCTCCTTCTTCTATCTACAAAAGCAGCTGCACAACAGTATACAATGGCTATGGG 
AAGGGCCATAGTGGTGGAAGTGGCGGAGGCGGCAGCGGTAGTGGTCCCGGAATTAAGAAAACA 
GAGCGACGAGCAAGAAGCAGCCCAAAGTCGAATGATTCAGACTTGCAAGAATATGAGTTGGAA 
GTAAAGAGGGTGCAAGACATTCTTTCGGGAATAGAGAAACCACAGGTTTCTAATATTCAGGCA 
AGAGCAGTTGTGTTGTCCTGGGCTCCCCCTGTTGGACTTTCCTGTGGACCCCACAGTGGTCTT 
TCCTTCCCCTACAGTTACGAGGTGGCCTTATCAGACAAAGGACGAGATGGAAAATACAAGATA 
ATTTACAGTGGAGAAGAATTAGAATGTAACCTGAAAGATCTTAGACCAGCAACAGATTATCAT 
GTGAGGGTGTATGCCATGTACAATTCCGTAAAGGGATCCTGCTCCGAGCCTGTTAGCTTCACC 
ACCCACAGCTGTGCACCCGAGTGTCCTTTCCCCCCTAAGCTGGCACATAGGAGCAAAAGTTCA 
CTAACCCTGCAGTGGAAGGCACCAATTGACAACGGTTCAAAAATCACCAACTACCTTTTAGAG 
TGGGATGAGGGAAAAAGAAATAGTGGTTTCAGACAGTGCTTCTTCGGGAGCCAGAAGCACTGC 
AAGTTGACAAAGCTTTGTCCGGCAATGGGGTACACATTCAGGCTGGCCGCTCGAAACGACATT 
GGCACCAGTGGTTATAGCCAAGAGGTGGTGTGCTACACATTAGGAAATATCCCTCAGATGCCT 
TCTGCACCAAGGCTGGTTCGAGCTGGCATCACATGGGTCACGTTGCAGTGGAGTAAGCCAGAA 
GGCTGTTCACCCGAGGAAGTGATCACCTACACCTTGGAAATTCAGGAGGATGAAAATGATAAC 
CTTTTCCACCCAAAATACACTGGAGAGGATTTAACCTGTACTGTGAAAAATCTCAAAAGAAGC 
ACACAGTATAAATTCAGGCTGACTGCTTCTAATACGGAAGGAAAAAGCTGTCCAAGCGAAGTT 
CTTGTTTGTACGACGAGTCCTGACAGGCCTGGACCTCCTACCAGACCGCTTGTCAAAGGCCCA 
GTTACATCTCATGGCTTTAGTGTCAAATGGGATCCCCCTAAGGACAATGGTGGTTCAGAAATC 
CTCAAGTACTTGCTAGAGATTACTGATGGAAATTCTGAAGGTGAAGTTTTTGGCAATTGTTTT 
ATTCAAATCCA ATAG CAAGCTCTGTTTTCTAATATAGTAAATGTCTTTATAGTAATAGTGAGT 
AATCATTAATTCTAAAGATAGAATTATTATTACAATAAACAAACTTTAGTCACATATTGGCAG 
TTTTTCTATTTCA7\ACACAGCACCAGAGATCAGAGTCTACTTGAAACTTACATTTGTGTTATT 
TAACAATTTTTCTGTATCTTTTTCATTGGTGTTTTGTTTTGTTTATCTTTTGTTTTTGTTTCT 
TTGGTTTGGTTTGTTTTTGTTTTGTTTTTTGAGATACGATCTCTGTCACACAGGCTGGAGGGC 
AGTGGCACAGACATGGCCCAT.TGCAGTCTCAGACTCCTGGGCTTAAGTGACTCTTCTGCCACA 
GAAGATGAGGAAGAATACATTTTTCATAGTGATGGGGTCTCACTATGTTATCTAGGCTGGTCT 
CAAACTCCTGGCCTCAAGCAACCCTCCACCTTGGCCTCCCAAAGTGCTGGGACTATAGACATG 
AATCACCACACTCAGCTTCCATGTCTTTTTATGAACTAGGGTTCCTAATTAATCAGATAAATT 
TGGTATTTTCATCTCCTAACTTGCCATATGTTTTCTGGAAATTCTTATAAGCAGCCGAGAGTG 
GTGGCTCACGCTGTAGTCCCAGCACTTTGGGAGGCTGAGGTGGGTGGTCAGGAGATCAAGACC 
ATCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGGTG 
GCAGGCACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGAAGAATTGCTTGAACCCAGCAG 
GCGGAGGTTGCAGTGAGCTGAGATTGCACCACTGCACTCCAGCCTGGTGACAGAGTGAGACTC 
TGTCTCAAAAAAAAAAAA 
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FIGURE 70 

MMMTDQIPLELPPLLNGEVAMMPHLVNGDAAQHVILVQVNPGETFTIRAEDGTLQCIQGPAEV 
PMMSPNGSIPPIHVPPGYISQVIEDSTGVRRVWTPQSPECYPPSYPSAMSPTHHLPPYLTHH 
PHFIHNSHTAYYPPVTGPGDMPPQFFPQHHLPHTIYGEQEIIPFYGMSSYITREDQYSKPPHK 
KLKDRQIDRQNRLNSPPSSIYKSSCTTVYNGYGKGHSGGSGGGGSGSGPGIKKTERRARSSPK 
SNDSDLQEYELEVKRVQDILSGIEKPQVSNIQARAWLSWAPPVGLSCGPHSGLSFPYSYEVA 
LSDKGRDGKYKIIYSGEELECNLKDLRPATDYHVRVYAMYNSVKGSCSEPVSFTTHSCAPECP 
FPPKLAHRSKSSLTLQWKAPIDNGSKITNYLLEWDEGKRNSGFRQCFFGSQKHCKLTKLCPAM 
GYTFRLAARNDIGTSGYSQEVVCYTLGNIPQMPSAPRLVRAGITWVTLQWSKPEGCSPEEVIT 
YTLEIQEDENDNLFHPKYTGEDLTCTVKNLKRSTQYKFRLTASNTEGKSCPSEVLVCTTSPDR 
PGPPTRPLVKGPVTSHGFSVKWDPPKDNGGSEILKYLLEITDGNSEGEVFGNCFIQIQ 

Important features of the protein: 
N-glycosylation sites . 

amino acids 69-73, 254-258, 401-405 
Glycosaminoglycan attachment sites. 

amino acids 229-233, 234-238, 236-240 

cAMP- and cGMP -dependent protein kinase phosphorylation sites. 

amino acids 4.16-420, 535-539 
Tyrosine kinase phosphorylation site, 
amino acids 319-326 
N-myristoylation sites . 

amino acids 52-58, 227-233, 228-234, 230-236, 231-237, 232-238, 
235-241, 239-245, 402-408, 610-616 
Amidation site, 
amino acids 414-418 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 290-301 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 546-554 

CUB domain proteins profile . 

amino acids 294-301 
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FIGURE 71 

AAGTCATTCAGTGGATGTGATCTTGGCTCACAGGGGACGMGTCAAGCTCTTCCTGGCTCCTTCTCAGCCTTGTT 
GCTGTAACTGCTGCTCAGTCCACCATTGAGGAACAGGCCAAGACATTTTTGGACAAGTTTAACCACGAAGCCGAA 
GACCTGTTCTATCAAAGTTCACTTGCTTCTTGGAATTATAACACCAATATTACTGAAGAGAATGTCCAAAACATG 
AATAATGCTGGGGACAAATGGTCTGCCTTTTTAAAGGAACAGTCCACACTTGCCCAAATGTATCCACTACAAGAA 
ATTCAGAATCTCACAGTCAAGCTTCAGCTGCAGGCTCTTCAGCAAAATGGGTCTTCAGTGCTCTCAGAAGACAAG 
AGCAAACGGTTGAACACAATTCTAAATACAATGAGCACCATCTACAGTACTGGAAAAGTTTGTAACCCAGATAAT 
CCACAAGAATGCTTATTACTTGAACCAGGTTTGAATGAAATAATGGCAAACAGTTTAGACTACAATGAGAGGCTC 
TGGGCTTGGGAAAGCTGGAGATCTGAGGTCGGCAAGCAGCTGAGGCCATTATATGAAGAGTATGTGGTCTTGAAA 
AATGAGATGGCAAGAGCAAATCATTATGAGGACTATGGGGATTATTGGAGAGGAGACTATGAAGTAAATGGGGTA 
GATGGCTATGACTACAGCCGCGGCCAGTTGATTGAAGATGTGGAACATACCTTTGAAGAGATTAAACCATTATAT 
GAACATCTTCATGCCTATGTGAGGGCAAAGTTGATGAATGCCTATCCTTCCTATATCAGTCCAATTGGATGCCTC 
CCTGCTCATTTGCTTGGTGATATGTGGGGTAGATTTTGGACAAATCTGTACTCTTTGACAGTTCCCTTTGGACAG 
AAACCAAACATAGATGTTACTGATGCAATGGTGGACCAGGCCTGGGATGCACAGAGAATATTC7VAGGAGGCCGAG 
AAGTTCTTTGTATCTGTTGGTCTTCCTAATATGACTCAAGGATTCTGGGAAAATTCCATGCTAACGGACCCAGGA 
AATGTTCAGAAAGCAGTCTGCCATGCCACAGCTTGGGACCTGGGGAAGGGCGACTTCAGGATCCTTATGTGCACA 
AAGGTGACAATGGACGACTTCCTGACAGCTCATCATGAGATGGGGCATATCCAGTATGATATGGCATATGCTGCA 
CAACGTTTTCTGCTAAGAAATGGAGCTAATGAAGGATTCCATGAAGCTGTTGGGGAAATCATGTCACTTTCTGCA 
GCCACACCTAAGCATTTAAAATCCATTGGTCTTCTGTCACCCGATTTTCAAGAAGACAATGAAACAGAAATAAAC 
TTCCTGCTCAAACAAGCACTCACGATTGTTGGGACTCTGCCATTTACTTACATGTTAGAGAAGTGGAGGTGGATG 
GTCTTTAAAGGGGAAATTCCCAAAGACCAGTGGATGAAAAAGTGGTGGGAGATGAAGCGAGAGATAGTTGGGGTG 
GTGGAACCTGTGCCCCATGATGAAACATACTGTGACCCCGCATCTCTGTTCCATGTTTCTGATGATTACTCATTC 
ATTCGATATTACACAAGGACCCTTTACCAATTCCAGTTTCAAGAAGCACTTTGTCAAGCAGCTAAACATGAAGGC 
CCTCTGCACAAATGTGACATCTCAAACTCTACAGAAGCTGGACAGAAACTGTTGTAAGAAATACCTCAAAATGTT 
GAACCTCTCCTAGTATTCAGTATTACTCATTTCCATGCCTAGGTTTGTATTTGATTTCTTTGTTCTAAAAAGAAA 
ATTTTATGGCCTCAAAATGTCCTCATTTACAAACCAAACATTTAATTTGTGGTCAGACAGGAACCTAGACCATAC. 
AACAATTGGGTGGGCCACCTCTTTTCTCCCTATCATAACTACAGCCCTCTCTTCCTGGTAATTGGAAGGAAAGAG 
CGGTTTAGGGTGGAATATATCTGTTAATATGCATTCTTTTCTTATCTGCCAGAAGCAAATTTAGCCAAGTCAAAG 
AGAAGAAACCATAGATCATAGATGTAAATATATGTACATCTGGAACCCCTCAAAAGGCCCTGAACCCCCTTTTTT 
TGTGTAGCAATATGCTGAGGCTTGGAAAATCAGAACCCTGGACCCTAGCATTGGAAAATGTTGTAGGAGCAAGAA 
CATGAATGTAAGGCCACTGCTCAACTACTTTGAGCCCTTATTTACCTGGCTGAAAGACCAGAACAAGAATTCTTT 
TGTGGGATGGAGTACCGACTGGAGTGCATATGCAGACCCAAAGCATCAAAGTGAGGATAAGCCTAAAATCAGCTC 
TTGGAGATAAAGCATATGAATGGAACGACAATGAAATGTACCTGTTCCGATCATCTGTTGCATATGCTATGAGGC 
AGTACTTTTTAAAAGTAAAAAATCAGATGATTCTTTTTGGGGAGGAGGATGTGCGAGTGGCTAATTTGAAACCAA 
GAATCTCCTTTAATTTCTTTGTCACTGCACCTAAAAATGTGTCTGATATCATTCCTAGAACTGAAGTTGAAAAGG 
CCATCAGGATGTCCCGGAGCCGTATCAATGATGCTTTCCGTCTGAATGACAACAGCCTAGAGTTTCTGGGGATAC 
AGCCAACACTTGGACCTCCTAACCAGCCCCCTGTTTCCATATGGCTGATTGTTTTTGGAGTTGTGATGGGAGTGA 
TAGTGGTTGGCATTGTCATCCTGATCTTCACTGGGATCAGAGATCGGAAGAAGAAAAATAAAGCAAGAAGTGGAG 
AAAATCCTTATGCCTCCATCGATATTAGCAAAGGAGAAAATAATCCAGGATTCCAAAACACTGATGATGTTCAGA 
CCTCCTTTTAGAAAAATCTATGTTTTTCCTCTTGAGGTGATTTTGTTGTATGTAAATGTTAATTTCATGGTATAG 
AAAATATAAGATGATAAAGATATCATTAAATGTCAAAACTATGACTCTGTTCAGAAAAAAAATTGTCCAAAGACA 
ACATGGCCAAGGAGAGAGCATCTTCATTGACATTGCTTTCAGTATTTATTTCTGTCTCTGGATTTGACTTCTGTT 
CTGTTTCTTAATAAGGATTTTGTATTAGAGTATATTAGGGAAAGTGTGTATTTGGTCTCACAGGCTGTTCAGGGA 
TAATCTAAATGTAAATGTCTGTTGAATTTCTGAAGTTGAAAACAAGGATATATCATTGGAGCAAGTGTTGGATCT 
TGTATGGAATATGGATGGATCACTTGTAAGGACAGTGCCTGGGAACTGGTGTAGCTGCAAGGATTGAGAATGGCA 
TGCATTAGCTCACTTTCATTTAATCCATTGTCAAGGATGACATGCTTTCTTCACAGTAACTCAGTTCAAGTACTA 
TGGTGATTTGCCTACAGTGATGTTTGGAATCGATCATGCTTTCTTCAAGGTGACAGGTCTAAAGAGAGAAGAATC 
CAGGGAACAGGTAGAGGACATTGCTTTTTCACTTCCAAGGTGCTTGATCAACATCTCCCTGACAACACAAAACTA 
GAGCCAGGGGCCTCCGTGAACTCCCCAGAGCATGCCTGATAGAAACTCATTTCTACTGTTCTCTAACTGTGGAGT 
GAATGGAAATTCCAACTGTATGTTCACCCTCTGAAGTGGGTACCCAGTCTCTTAAATCTTTTGTATTTGCTCACA 
GTGTTTGAGCAGTGCTGAGCACAAAGCAGACACTCAATAAATGCTAGATTTACAAAA 
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FIGURE 72 

MSSSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMN 
NAG DKWS AFLKEQSTLAQMY PLQE I QNLT VKLQLQALQQNG S S VLSE DKS KRLNT I LNTMST I 
YSTGKVCNPDNPQECLLLEPGLNEIMANSLDYNERLWAWESWRSEVGKQLRPLYEEYWLKNE 
MARANHYEDYGDYWRGDYEVNGVDGYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLMNAY 
PSYISPIGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFF 
VSVGLPNMTQGFWENSMLTDPGNVQKAVCHPTAWDLGKGDFRILMCTKVTMDDFLTAHHEMGH 
IQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLK 
QALTIVGTLPFTYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLF 
HVSDDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLL 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-glycosylation sites. 

amino acids 53-57, 90-94, 103-107, 322-326, 432-438, 546-550 

N-myristoylation sites . 

amino acids 260-266, 286-292, 395-401 

Cell attachment sequence. 

amino acids 204-207 

Neutral zinc metallopeptidases , zinc-binding region signature. 

amino acids 371-381 
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FIGURE 73 

CCCACGCGTCCGAGCGGGGTGGACAAGTGGCGTGTGTGCTGCGACCCCGAGGGAAGATGAACG 
GGACGCGGAACTGGTGTACCCTGGTGGACGTGCACCCAGAGGACCAGGCGGCGGCGGGCAGGA 
AGACCTATGCCATGGTGTCCAGCCACTCAGCTGGTCATTCTCTGGCTTCAGAACTGGTGGAGT 
CCCATGATGGACATGAGGAGATCATTAAGGTGTACTTGAAGGGGAGGTCTGGAGACAAGATGA 
TTCACGAGAAGAATATTAACCAGCTGAAGAGTGAGGTCCAGTACATCCAGGAGGCCAGGAACT 
GCCTACAGAAGCTCCGGGAGGATATAAGTAGCAAGCTTGACAGGAACCTAGGAGATTCTCTCC 
ATCGACAGGAGATACAGGTGGTGCTAGAAAAGCCAAATGGCTTTAGTCAGAGTCCCACAGCCC 
TGTACAGCAGCCCACCTGAGGTGGACACCTGTATAAATGAGGATGTTGAGAGCTTGAGGAAGA 
CGGTGCAGGACTTGCTGGCCAAGCTTCAGGAGGCCAAGCGGCAACACCAGTCAGACTGTGTGG 
CTTTTGAGGTCACACTCAGCCGGTACCAGAGGGAAGCAGAACAAAGTAATGTGGCCCTTCAGA 
GAGAGGAGGACAGATGTCCAGAGTGATTGGAGAATGTCCTGGGGGAATGAAGTTCCTTCCACA 
AACACAGCTCAGTTCTTAGCAACAAACTGTTTGTTTTTCTACTTGCTCCATCTGCAGCCTACG 
CTGCCCTGGCCTCCTGCAGACAGATAGTGGGGTTACCTGGCAAGGCCTGGTGAGAGCCAGTGA 
ACCTAAGCTTTGACTGGGTGGCCTTGTCTTTCTGGGGAGGAGGGJ\ATGTACATTCAGGGAGTA 
GCCTTTTGCGGAAAAATTCTCTAGGGCTACAGACAGTCATGTGTGACTTCTCTCTGCTGTGAA 
AACTCCCAGAGTCTCTTTAGGGATTTTCCCTAAGGTGTACCACCAGGCACACCTCAGTCTTCT 
TGACCCAGAGCCTGAAAACTGTTTTCACTGGGTTCCACCAGTCCCAGCAAAATCCTCTTTGTA 
TTTATTTTGCTAAGTTATTGGTGGTTTTGCTTACATCTCATGATTGATATAATACCAAAGTTC 
TATAGCCTTCTCTTGCAGTATTTGGATTTGCTTGAAACCGGGAAAACTGTTCCCATTAGGCTT 
GTTAATGTCAGAGTGACACTATTATGAATCTTTCTCTCCCTTTCCTCTGCCTGTTTCTTCTCT 
CTTTCTCCTTCAAACTTGCTCTGCAGCTAAGGAAGGTGAGTCTACTTTCCCTGAGGCTTTGGG 
GTCAGAGTATATGTTGTTTGGAGAAAGAGGGCAATCAGGACTCTTCTGGGACCCAGATGAGTT 
CTTCACTAGCCCTTCTGAACCCCTTGCTCCATAATTGGTCTTTTATCCTGGCTCTGAATGACC 
CTGCAGGTCATCATGGTTTTCTTTTTTTATTGTTTTTTTTTTTTTCTGAGACAGAGTCTCACT 
CTGTCACCCAGGCTGGAGTGCAGTGGCGCGATCTCAGCTCACTGCAACCTCTGCCTCCCGGAT 
TTAAGCGATTCTTCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGTGTGCCACCACGCCTG 
GCTGATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATACTGGCTAGGCTGGTCTCGAATT 
CCTGACCTCAGGTGATCCACCCACCTCGGCTTCCCAAAGTGCTAGGATTATAGGCTTGAGCTA 
CTGCGCCCGGCCCATGGTGTTTTTCTTTAGGGCTCTTCCTACAGCCTTGAGAAGTAGATAGGC 
ATCAGAGTATGGTACTATAGGAATCAGAAAAATTCAAAACAAATGTGGATTAAGTGTTTAGGC 
TCTATGTGGCTCACGCAGCCAGAATCCTTAAGTCTGTGTGTTTCTGTGTCTCAAGACTGGGCT 
CACATTCTGGCTTTGTCCATAACAATGCTCTGGGATTTCAGGGAGTTCCCTCATTTGTAAAAT 
GAGGGGGTCAGAGCAGGTGATATCCATGTTTCTTCCCTTTCTGATATTGTTGTCTGTGGCATA 
TTCTTTGTATGGCGAATTTAATAAATTATATTAATGTGTCA 
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FIGURE 74 

MNGTRNWCTLVDVHPEDQAAAGRKTYAMVSSHSAGHSLASELVESHDGHEEIIKVYLKGRSGD 
KMIHEKNINQLKSEVQYIQEARNCLQKLREDISSKLDRNLGDSLHRQEIQVVLEKPNGFSQSP 
TALYSSPPEVDTCINEDVESLRKTVQDLLAKLQEAKRQHQSDCVAFEVTLSRYQREAEQSNVA 

LQREEDRCPE 

Important features of the protein: 
Signal peptide: 

amino acicis 1-39 

N-glycosylation site. 

amino acids 2-6 

Amidation site. 

amino acids 21-25 
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FIGURE 75 

GCTTGCACACATGGCTCCGGAGGCTCCGGTTGCCCATCCGAGCCCCTGCCAGGCTCTAACGTTCCCAACTGACAA 
CACCAGTAACTAAATATAGGAGCAGATGGTGGGGACGGGCTGTCGCAGCGGCTCCTTTGCAGAGGTCTCCGGACT 
GCAGATAAGGCTCAGGCCCTTTTGTGAGAAGCAGACCAGCCTGGGGGCTGGCGGCAGGACACCTGTGTCTGCATG 
CTGAAGAAGATGGGTGAGGCCGTGGCCAGAGTAGCAAGGAAGGTCAACGAGACGGTGGAGAGCGGCTCTGACACT 
CTGGACCTGGCCGAGTGCAAGCTGGTCTCCTTTCCCATTGGCATCTACAAGGTCCTGCGGAATGTCTCTGGCCAG 
ATCCACCTCATCACCCTGGCTAACAACGAGCTTAAGTCCCTCACCAGCAAGTTCATGACCACATTCAGTCAGCTC 
CGAGAGCTCCACCTGGAGGGGAACTTCCTACACCGCCTCCCCAGCGAGGTCAGTGCCCTGCAGCACCTCAAGGCC 
ATTGACCTGTCCCGGAACCAGTTCCAGGACTTCCCTGAGCAGCTTACCGCCCTGCCGGCGCTGGAGACCATCAAC 
CTGGAGGAGAACGAGATCGTAGATGTGCCCGTGGAGAAGCTGGCCGCCATGCCAGCCTTGCGCAGCATCAACCTC 
CGCTTCAACCCACTCAACGCCGAGGTGCGCGTGATCGCCCCGCCGCTCATCAAGTTTGACATGCTCATGTCTCCG 
GAAGGCGCAAGAGCCCCCCTACCTTAGGCCACCCTCCTCATGCCCACCCAGCAAGGGACAGAGGCCACAGGCCTG 
GAACCCTGGAAGGGAGGGAGGCCCATGGGAGGCCAAGCCTGGGGGCTGGGGGCGGGTGGGCCGAGCAGCACGTGG 
TGGGTGGGGTGCAGCTGGTCTGGATAGATAGCTTACAGCAGTAGTGGGCTCTGGAATGCCCAAGGGAAGAGGCAA 
GGTGGGGCCTGCAGCCTGGACTCGGCACTCACAGCTGCTGTGCAAACTCAGGCAGATCTCCTGCCCTCTCTGAGC 
CTTGTCACTTGAAAAAAACAGGACCCTTTCCCTCCTTTGGGCTCCCTGGAGGTTTTTAAGCAGTACGTGCCTCCA 
AGTTACCTCCAGATCAGCAGGCACAGGTGGGCATTGCCAGGTATTTTCTGAGCCCCTGCGGGTTTGAGGCCTTGT 
TTTTAGTGCTGAGAGCCAGTTGCTGCCCTGAGAAGAGAAGACAACCTCCATCTATTTATTGCTTCCTGAGAACTG 
ACCTGGATGCGGCCCTCTGCAGGGCCCAGTCTTCAGTCCTGTGGTCCCTGGACTGGTGGGAACCTGAACTAGGAG 
TCCTGGGAGAGCTGTGGTGGGAATATGGGCTGGCACTGCTGCAGGGCAAGAACATTCATGTAGGAGCCCGAGGAC 
CANCANGCTGGGAATGGGGAGCAAGTCACGTCAGCTCTGTCATTCCCCACAGTTAACAAATTGGCGGGGTGGGAA 
GTCCTGAGTGCTCCGTCCCTCTAGCATCACTCCTGAGCTGCGGGAGAGGTGGCCCAGAGAACAGCAGAGTCAGTT 
ACACCTGCAGCTCTTGTCTAAAGTGATTAGATGGCCACCCTCACCACTGTCCAGTCCAGCAGCAGCCTGGCTGCC 
TTGTCATGGCCTCCTGGGGGCAGAAGGCGATGTGGACCACGGGATTTGTAGCCAGCCAGCTCCCAGGCCAACGCC 
CAAAGCCCTGATGACCTGGTTCTTCTGAGGCCCTCAACCTGGCATCTTAGGGTATGGTCAGGCAACAGGGTGACC 
AGCTGTCCTGGTTTCCCAGGACATGGAACTTTCAATGCTAAAACTGGGACATTACCCAGCAAGTGGGGATGGTTG 
GTCCCCTACCAGGAGAGGGCCTGGGGCTCTTGCTTCCCGAGAACGCCTGTGGCTTGAAGAACCTTGACTGCTTGG 
TCCTCAGGTATCTACCTCCCACCTTCTCCTCATCTGTGGAGCAAGCCAACTCAGTGCCCCAGACCCCACCTGATC 
TGCATCTTTGTTTGCTCCAGAGACACCTGAGGCCCCAGAGCTTGAGGCAAAGCCAGGCCGTCCAAATCCTGTGTG 
CCGTGGACGAGTGGCCACTTTACTACTCCTAAGGCTAAGATGTTGAGAGCTCAGACCACTGCTCAGAGCAGTAAT 
CCCTGCTCAGAATGCTCCCAGTTCCCTCGTCCCTGCCCAGGTCTCTTGTCTCTTGGGAAGGAACTGATAGGTCGG 
GCCATTGTTGGGCCATCACTGAGCGCTCAGTATCTCAAGAGACTCTGTTCATTCTGCTCGTATCCCAAGGCCTGG 
TTGGTCAAACTCTGGGCAAAGGGTTTTCAGGATGAGGAGGTCAAGACAGGATGTCCAGAGCTACCGAGTTCATCT 
GTGGGTGTTGGGGGCAAGTGGGGGCTGAAGTCCTGTGCAGGCTGCGCTGGCCCCACCTGCCTTGTGCCCTGGAGT 
GGGGTTTCTCCTTGTTGAAGAAGAGGCATCCTTCTCTGATGTGCACAAACACAATGTATGACCAGAGCCTTGCAA 
CTCAAAGTGTGGTCTGTGGACCAGCAGCGGCAGTGACACCTGGGAGCTTGTTAGGAATGCAGAGTCTAGGCCTCA 
CCCTATACCTCCCGACTCAGACCCTGCATTTTAGCAAGACCCCCAGCTGATTCCTATAAGCACTTTAGAGTTTGA 
GAAGCAAGGACCTAGGCTGGGGATGTCCTCCGAGCAGAGGGTGAAGTTTCTCTCAGTTCTCTCCCTGCCACTTCC 
AGGGATCTGAGCCTGTGTTCAGCCTCCTCCCTAACCCACCCTGGGAGACACTTGGCCTGTTAGATTGTTCCAGAG 
TCTGCATGGCACTCCTGAAGAAGGGAGTGTGACCTGCAGTCACCAGGAGATGAGGGTTAGGTGTGCCCAGCCCTC 
CAGACCCGGCCTTTCTGGTTAACCCCTGCATGCCAAGCTGCCTGCTGCCCCAGGTCCTCACCTCAGGCCTTTGAA 
GGGGCAGCTTCTGGAAGTTGTTTTCTCCTCTGCTTGGAGAGTTTGCCCTTGTCTGTCTTGGAAAGTGTGGGCAGC 
CACAGATGCCCCCAAATCAGAGCTCACAGTGAGTGAGCCCCTAAGCTTCAGTCTGCAATAAAGAATGCATTGGTT 
TCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 76 

MLKKMGEAVARVARKVNETVESGSDTLDLAECKLVSFPIGIYKVLRNVSGQIHLITLANNELK 
SLTSKFMTTFSQLRELHLEGNFLHRLPSEVSALQHLKAIDLSRNQFQDFPEQLTALPALETIN 
LEENEIVDVPVEKLAAMPALRSINLRFNPLNAEVRVIAPPLIKFDMLMSP'EGARAPLP 

Important features of the protein: 
N-glycosylation sites . 

amino acids 17-21, 47-51 
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FIGURE 77 

CACCAACAAGCAATCGTTCATGAGAAAGCCGTGCACCCGCTGCAGTTGGGCCATGTGGTCCGCATCGTATTCCAC 
TAGGTCCCCATTGTACACCAAGTACTGTCCCGGCGTCTCCAGCAGATGCCTGCAGCCTTCCACCTTCTCAAGCAG 
GGTGGTGTGAGTGCGCTGCTTTCCTTCCTCGCCTGGACCGGAGCCGTCGCGGGAGGCACCCCCGGGGGTGGAGAA 
AAAGCCGGCCTGGCCTCGGAGGTGGTCTCGGCCCCCCGCCCCACCGACTCCCTCCTCCCCTCCAGAGGCGGCGGC 
GGCTCCGGCGGCAGCAGCGGCAGGCAGCAACGTAAGCGGGATGCTCTCCAGGCTGCTTTTCTGCTCGGTCAGCAA 
ATGGCTGAGCTGGTACATCTCGCTCTCCAGGTAGGAGATCTCGCGGGCCGTCTCTATGAACTGCCGGTAGTTCTG 
GTAGACGTTGCGCTTCAGGTTCTGCGCCGTCTCCTCCGCCAGCGCCTGGATGCGCTGCCGGTGCTCCTGGAGGTC 
CCGGTCCCCATCCGACTGCTGCGAGAGCTGCTTCACGTACAGCCGCGCCTCAAAACCCCCTGACTCCAGCTGCCG 
ACGCAGGCGGCTCGCCCCACTGTCCGACATCGCCATCGCCATTTCTCTCCGGGTCTCACGCACTCACTGTCACTA 
TCGGCGCCGCAGCCGCCGCGGCTGTCTAGACCCACCCAAGGCCAACCGAGCTCCTGGGCTGAGGAAGCAGGAATG 
GGAACGAGACGAGTACGCCTGCGCCGGGTCTGAGCGTCAGACACTGCGCCTGCGCAAGTGGGCCGAGCGCAGACA 
TTGCGCCTGCGCAGCAATGCCATCGGTTAAAGCGCATGCGCAAGATGAGCTATTGCGGAAGTGAGGGGAGGGAGA 
GGCCGAGAGAAATTTCGGTACTGCGCATGAACCGAGCGTGACGTTGAGGTTTGAAATAACCGGCAAAGAGTAAAG 
GCTGAAACTAGCTTCCTGAAAGCTTCGTAGGGCCCGAGCCCTGTGAGCCCAGGTTCTGCGCCCACTAGGAGGTGT 
CATGCTGACTGCTTTTTTTAAAGCCCTAGAATCCTTGGCTTCGGCGTTTGGGGTAAGCTCCGTTCTCGTTCTCAA 
GCGGGTTTCCGCGAACTCTCGCGGGATTGACGGGCCGTCTCGAGAGCCGGCATCTCCTAGGAGCTAGTCCTGGTC 
CTCGGCTAGGCGGCTTGGGGTCGCGGCGTAACTGGGGAGCCAGCCTGACGCCGGCGGACCCCGCCTGTGATCCTG 
GCAACGATGGATGATGACTTGATGTTGGCACTGCGGCTTCAGGAGGAGTGGAACTTGCAGGAGGCGGAGCGCGAT 
CATGCCCAGGAGTCCCTGTCGCTAGTGGACGCGTCGTGGGAGTTGGTGGACCCCACACCGGACTTGCAGGCACTG 
TTTGTTCAGTTTAACGACCAATTCTTCTGGGGCCAGCTGGAGGCCGTCGAGGTGAAGTGGAGCGTGCGAATGACC 
CTGTGTGCTGGGATATGCAGCTATGAAGGGAAGGGTGGAATGTGTTCCATCCGTCTCAGCGAACCCCTTTTGAAG 
TTGAGGCCAAGAAAGGATCTTGTAGAGACCCTCCTGCATGAAATGATACATGCCTATTTATTTGTCACTAATAAC 
GACAAAGACCGAGAAGGGCATGGTCCAGAATTTTGTAAACATATGCATCGCATCAACAGCCTGACTGGAGCCAAT 
ATAACGGTATACCATACTTTTCACGATGAGGTGGATGAGTATCGGCGACACTGGTGGCGCTGCAATGGGCCGTGC 
CAGCACAGGCCACCGTATTACGGCTATGTCAAACGAGCTACTAACAGGGAACCCTCTGCTCATGACTATTGGTGG 
GCTGAGCACCAGAAAACCTGTGGAGGCACTTACATAAAAATCAAGGAACCAGAGAATTACTCAAAAAAAGGCAAA 
GGAAAGGCAAAACTAGGAAAGGAACCAGTATTGGCCGCAGAGAATAAAGGTACCTTCGTGTATATTCTTCTGATT 
TTTATG TGA CCATAGCTATGATGTAAAGACAATACTGTCCTTCAGAGAACTGGTATTAAGATAAACTTAAGGATC 
GTTTCTGGTGTAGAAGTCTTCAAGTGTAGACTTAAGGAAAAAATCCCACTGTCCATGAAATGATGGTAGGAAAAC 
AGACTTTGCTCTGTACAGAAGTAAGTAAAAGTAGGAATAGTTTCCATGGATATTTTTATTTTTATTAACTTTTTT 
CAGTTTCTTTTTATTCAAAGAAACAAAATTCAATCTCTGATAATATTTGAGGTAAAGTTCCTTTCCCTATCTTGA 
C T C AC T GAG T TAT T AGG AAAC AG AAGG C AAAAAG AT T GT CAAAAT AAAAAC AAT AAT T C AAGT AAC AAT GC C C G G 
AATATACGTCCTAACTACACCCCTTCCTATCAGCTGGATTCTATCCAAGTGACTCTATTGATGTATGTATGTTCA 
TTCAAAGAATGGGAAAAGGATATGACATATATTTGCCAGTACTTCATCTTCAAGATTTACCCTTTTCCTGTGAAG 
TTCAGAGTTACTGAAGATGCTTCTTCCCTTGGGAAGTTGTTGACCCAAGAACATAGGTTATATTTCCCAAATCTT 
TAATTATTGAGTGAAAGAGCTATAGATGAATTGATATGGAAAGACCGTATCTTCATTTTCGTGAGTAGAAGGAAA 
GATAAGAATGAGGCAGCAGATTTTCCCTCCTGGAATTACACATAAAGGACACTAAGCAATTTTCAAGGTAAATGT 
TGCCTTGTTGTTGGTCTTTGGCATGATAAGATTCTTTATTTAAATATGAGAGAATTTTTTTTTATCCTTTATATT 
CTCTCAATATCAGAACTCCTGAATTCTGAAGATTGCCCTCCTCCCATTAATAGGATTGTATGGATGTAAGATGGA 
ATAAAATACTAGTTCTTCATTTTGAGAAAACTGTACATTAGTTTAATGTTTGTTACTGTATTTCTTTTGAGTTGA 
GGCACTTACATAACAATCTTCTTTGCTTTTTTGGCAGATAAACCCAACAGAGGTGAGGCCCAGCTAGTAATCCCT 
TTTAGTGGGAAAGGATATGTTCTAGGAGAAACAAGCAATTTACCTTCACCTGGGAAACTGATCACTTCACATGCC 
ATTAATAAAACCCAAGATCTTTTAAATCAAAACCATTCAGCAAATGCTGTAAGACCTAATTCTAAAATCAAGGTG 
AAATTTGAACAGAATGGTTCAAGTAAAAATTCTCATCTGGTCTCCCCTGCTGTTAGTAACAGTCACCAAAATGTT 
CTAAGCAACTACTTTCCTAGAGTATCATTTGCCAACCAAAAGGCTTTCAGAGGTGTGAATGGATCTCCAAGGATA 
AGTGTAACAGTTGGCAACATCCCTAAAAACTCAGTCTCTTCTAGTTCTCAGAGAAGGGTTTCATCTTCTAAGATA 
TCCCTAAGAAATTCTTCAAAAGTAACGGAATCAGCATCTGTGATGCCATCCCAGGATGTGAGTGGGTCTGAAGAT 
ACATTCCCAAATAAACGACCTAGGCTAGAAGATAAAAAAAAA 
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FIGURE 78 

MDDDLMLALRLQEEWNLQEAERDHAQESLSLVDASWELVDPTPDLQALFVQFN DQFFWGQLEA 
VEVKWSVRMTLCAGICSYEGKGGMCSIRLSEPLLKLRPRKDLVETLLHEMIHAYLFVTNNDKD 
REGHGPEFCKHMHRINSLTGANITVYHTFHDEVDEYRRHWWRCNGPCQHRPPYYGYVKRATNR 
EPSAHDYWWAEHQKTCGGTYIKIKEPENYSKKGKGKAKLGKEPVLAAENKGTFVYILLIFM 

Important features of the protein: 
Signal peptide: 

amino acids 1-41 

N-glycosylation sites . 

amino acids 148-151, 217-220 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 184-187 

Casein kinase II phosphorylation sites. 

amino acids 30-33, 121-124, 154-157, 187-190/ 192-195 

Tyrosine kinase phosphorylation site. 

amino acids 211-218 

N-myristoylation sites. 

amino acids 59-64, 85-90, 146-151 

Neutral zinc. metallopeptidases , zinc-binding region signature. 

amino acids 108-117 
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FIGURE 79 

CGGACGCGTGGGTGGCAACCAGGAGAAGCCAAACTTGGTCCCCCGGCTCGCGGAGTGCCTGCG 
AGCGGTGCTCATGGCGCTCTATGAGGTCTTCTCTCACCCGGTCGAGCGCAGTTACCGCGCGGG 
GCTCTGCTCCAAAGCCGCGCTGTTCCTGCTGCTGGCCGCTGCGCTCACGTACATCCCGCCGCT 
GCTGGTGGCCTTCCGGAGCCACGGGTTTTGGCTGAAGCGGAGCAGCTACGAGGAGCAGCCGAC 
CGTGCGCTTCCAACACCAGGTGCTGCTCGTGGCCCTGCTCGGACCCGAAAGCGACGGGTTCCT 
CGCCTGGAGCACGTTCCCCGCCTTCAACCGGCTGCAAGGGGATCGCCTGCGCGTCCCGCTCGT 
TTCGACTAGAGAAGAAGACAGGAACCAGGATGGGAAGACGGACATGTTACATTTTAAGCTGGA 
GCTTCCCCTGCAGTCCACGGAGCACGTTCTCGGTGTGCAGCTCATCCTGACTTTCTCCTATCG 
ATTACACAGGATGGCGACCCTCGTGATGCAGAGCATGGCGTTTCTCCAGTCCTCCTTTCCTGT 
CCCGGGATCCCAGTTATACGTGAACGGAGACCTGAGGCTGCAGCAGAAGCAGCCGCTGAGCTG 
TGGTGGCCTAGATGCCCGATACAACATATCCGTGATCAACGGGACCAGCCCCTTTGCCTATGA 
CTACGACCTCACCCATATTGTTGCTGCCTACCAGGAGAGGAACGTTACCACCGTCCTG7^ATGA 
TCCCAACCCCATCTGGCTGGTGGGCAGGGCCGCAGATGCTCCATTTGTGATTAATGCTATCAT 
CCGATACCCTGTGGAAGTCATTTCTTATCAGCCAGGATTCTGGGAGATGGTAAAGTTCGCCTG 
GGTACAGTATGTCAGCATCCTGCTTATCTTCCTCTGGGTGTTTGAAAGAATCAAGATCTTCGT 
GTTTCAGAATCAGGTGGTGACCACCATTCCTGTGACAGTGACGCCCCGGGGAGACTTGTGTAA 
GGAGCACTTATCCTAGAAAGGCCATTTCTGAAGACTCAGCAGGACCGTGGCTGCCTCATTGTC 
ATCTTCTGGGAACATCTTAGGACCTTTTGAAAGAGCCCAGCGGACACCTGCGGGCTTGTGTGC 
TTTTCCCTCAGAGACAACGGTTCTTTCCGGTTTTGCTCTACACAGTTCCGTATCTTCAGAGCT 
CCTGCAGAATTGTCAGGGACTAGTTTGTGGAAAGGTCTGAGAGTTCCTGGAGGCTATAATTAG 
CTTTTTGGGTTTTCCTTCTTTGCCTTAGCGTTGAATTTCAGGAGAAAATTGCAGTCAGTTCAG 
ACATCTTGGAAAGAGTCCCATCTCTGGTCAAGCAGAGACTTTTCCTCTGTTGAACTGAGGAAC 
ACACTGTGCATTTCTTCCTTCTGTTGTGAGCCACTCTTACTCTTTTCAGGGCTCTCTTGTGAC 
AAACATGCCAATCACTAGCACTTTGCACCCCTGGGCTTCTCCATTTCCCATTCACAGCTTTGA 
TTTCCAGAGCTGAGGCCTTTAACTGGAGACCTGGAGGGGCAGGGCCCAAGGGCAAGGGCCGCA 
TTAGCACAGGCAATCAGGGAGGGCCGCTGAAGGACACTTGGACCGTCCACCTGCCCCAGCCCA 
ACAGTCAGTCATCTGTCATCAGCTCAGCTGAGCAGCCCTGGATCTTTGCCGTACTGTGACTGG 
GCTCTTTGCCCTATTTTTCCCTCTGTCTGTGCCCCTGGATGGCAGGCTGAAGTCAGAGGGGCT 
GTTTCATTCTCAGCCCCCTCAGCAGCACTGGGGGAAGAAAGCATTGT.CACAACAGGTTCTTTC 
TGGCCCTCACCCAACAGCCTGGGCACTTGGCCCTCCTCCTCCTTGACAGCCCTCCCCCTTCCT 
GCAAAGGACAGGGGCGACAGGGGTTGGTGTTGGGATTGGCTCCCGCTGCCTGACAACCACAAG 
TTTATTTGGAAGGCTAGCGGGAAGCCCAGCGGCTGGCGTTTCCCTTGACTAAGGAACAGGGTG 
CCCATCAGAGTGGGGCGGGCAGCTTTGGGAAGGACACAAGAAGCAGTAAGAGTGTAAAGAGGA 
TGCTGGCCTGGGCAGGCCAGTCCAGCCTGGCCACTAGCAGAATACCAAGCAGTCCAGTGGATT 
ACCCTCGTGGCTAAGCAAGTGTCTGCAGGAGCAGAGATGGCTGGAAGGGGCCTCTGCACACGG 
AAGATGGCTTGTTCAGCCCATTCACCTCCTGAGGATGTGGGCAGTCTCCTCCAAGAACACATG 
GAGCTGCTTCCTGATCCCAAGCAGGTCATTGCCACTGGAAGGACATGGCCCCGGTGATCCATG 
CTTCATGCCCACCCAGAAACACACCCCTCAGTGTGTGCCTCAGTTTACTTTGGAGATCAGTTG 
TCGTTTTTAGTGCTCCTTTAGGCTTACTAAAACAGTTTTGGAAACTIAAGCTATTTTGAAGTAT 
TCAAGCAGAGGAATTCCCTAACACTGACCCCCTTGTCTTTTTTTAATATTCAGGCTGTTTTAT 
ATGCCTAAATTTTTTTCTTAAGATCTAAACGAAAAATAGTTTCTTGTTT7\AATTCACATAAGG 
CAATGAGATATGGAAAGATGACAAGATACGTATAAACATTGGTTTGCATCTTATTAAATTATT 
CTAATGCAAATCTTGTATAAAGAACCCATGATGTTTTGTAACTTTCTAATTAAAATGTTCAAA 
AT GAG 
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FIGURE 80 

MALYEVFSHPVERSYRAGLCSKAALFLLLAAALTYIPPLLVAFRSHGFWLKRSSYEEQPTVRF 
QHQVLLVALLGPESDGFLAWSTFPAFNRLQGDRLRVPLVSTREEDRNQDGKTDMLHFKLELPL 
QSTEHVLGVQLILTFSYRLHRMATLVMQSMAFLQSSFPVPGSQLYVNGDLRLQQKQPLSCGGL 
DARYNISVINGTSPFAYDYDLTHIVAAYQERNVTTVLNDPNPIWLVGRAADAPFVINAIIRYP 
VEVISYQPGFWEMVKFAWVQYVSILLIFLWVFERIKIFVFQNQWTTIPVTVTPRGDLCKEHLS 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 268-284 

N-glycosylation sites . 

amino acids 194-198, 199-203, 221-225 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 51-55 

Tyrosine kinase phosphorylation site. 

amino acids 250-259 

N-myristoylation site. 

amino acids 187-193 

Cell attachment sequence. 

amino acids 307-310 
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FIGURE 81 

GCCGGGAGCTTCCCTGATGGTGCCGCCGCCTCCGAGCCGGGGAGGAGCTGCCAGGGGCCAGCTGGGCAGGAGCCT 
GGGTCCGCTGCTGCTGCTCCTGGCGTTGGGACACACGTGGACCTACAGAGAGGAGCCGGAGGACGGCGACAGAGA 
AATCTGCTCAGAGAGCAAAATCGCGACGACTAAATACCCGTGTCTGAAGTCTTCAGGCGAGCTCACCACATGCTA 
CAGGAAAAAGTGCTGCAAAGGATATAAATTTGTTCTTGGACAATGCATCCCAGAAGATTACGACGTTTGTGCCGA 
GGCTCCCTGTGAACAGCAGTGCACGGACAACTTTGGCCGAGTGCTGTGTACTTGTTATCCGGGATACCGATATGA 
CCGGGAGAGACACCGGAAGCGGGAGAAGCCATACTGTCTGGATATTGATGAGTGTGCCAGCAGCAATGGGACGCT 
GTGTGCCCACATCTGCATCAATACCTTGGGCAGCTACCGCTGCGAGTGCCGGGAAGGCTACATCCGGGAAGATGA 
TGGGAAGACATGTACCAGGGGAGACAAATATCCCAATGACACTGGCCATGAGAAGTCTGAGAACATGGTGAAAGC 
CGGAACTTGCTGTGCCACATGCAAGGAGTTCTACCAGATGAAGCAGACCGTGCTGCAGCTGAAGCAAAAGATTGC 
TCTGCTCCCCAACAATGCAGCTGACCTGGGCAAGTATATCACTGGTGACAAGGTGCTGGCCTCAAACACCTACCT 
TCCAGGACCTCCTGGCCTGCCTGGGGGCCAGGGCCCTCCCGGCTCACCAGGACCAAAGGGAAGCCCAGGCTTCCC 
CGGTATGCCAGGCCCTCCTGGGCAGCCCGGCCCACGGGGCTCAATGGGACCCATGGGACCATCTCCTGATCTGTC 
CCACATTAAGCAAGGCCGGAGGGGCCCTGTGGGTCCACCAGGGGCACCAGGAAGAGATGGTTCTAAGGGGGAGAG 
AGGAGCGCCTGGGCCCAGAGGGTCTCCAGGACCCCCTGGTTCTTTCGACTTCCTGCTACTTATGCTGGCTGACAT 
CCGCAATGACATCACTGAGCTGCAGGAAAAGGTGTTCGGGCACCGGACTCACTCTTCAGCAGAGGAGTTCCCTTT 
ACCTCAGGAATTTCCCAGCTACCCAGAAGCCATGGACCTGGGCTCTGGAGATGACCATCCAAGAAGAACTGAGAC 
AAGAGACTTGAGAGCCCCCAGAGACTTCTACCCATAGCACATCCCAACACCGTCACGCCAAAGGAAGAGAAAGAT 
CAACTCACCTGCAGTTAAACCATCTAAAGAGAAGAAAGACCACTGGAGACCTAGAAAACATACATTTTTCTCTTC 
TCTTCTCCTGACGTCTCTCCACTCCTCTTCTTCCAAATACGATGCTATTTTCAGAGTCCCCTCCTAGGCCTGCAG 
ACATGAGGGAGTGAATGATTGATTTACCTGCTTCTCACTAAGAGTCCATTGGGGTGGTTTGCATTGTAACTTTTC 
TTTTACATCCTATTTTTCCAGGAACTTTGGATTTAAGTACTCTCACAGTGTCTTAAATCATAAATTCTTGAAGTT 
AAATTTGGCAGAGTATCAAAAGGGGGAAAATGACAAAGTGAGCTCTAAGAAAATGTGAGGCTACTTCTAAGATGT 
GTGTTCACAATAGACCATAACTCCTCTAGTATCAAAATTGGGGCTCTTCAGTTAAAAAGGGGTGGGGAGGACAAA 
CGTGTCGATGTGCTTTGGTGGAGAATTTTTTCCTTGTGCTTCTAGTAGACTTTAAATATTGTATCCCTTTGTCAA 
ACCTTGTTTCCCAAATTCAATTAAAGAGAGGAGAGAATTGAATGGCGTTTAGAGAAGATAGAAAAGAATCACAGT 
CATATATTTACTGTTATATAGATTGCCACATTCTAAAATTCAAATACGGTGCTTAAGGTTTCATGCCATGCTTAT 
CTGTAAGTATCCTATTTAGGGAAGAAGATTAAACTCTCTTTTCAAAAAAACAAAGTGAAATGCCTGGATTCACAT 
TAAAACAATGGGCTCTCGTTTGCTATAATATTTTAAAGCTGTTTAATCAACAGTGGAGTCTGCTCTATAAATATA 
GATTATTTGTTCAATAAACTGGCTGAGCTTAGAGAGAGGTGCAGAATTCCTGGTTCTGAGCAGGTGCCCAGAAGG 
TACCATTAGGTGCCATGATCCAGGCTGAACCAATATACAGTGGGGCTGAAGTCTGCAAGGAGGTTGCTGGCTTGG 
GCTGACCTCACTAATGCCATCAGCAGCGGTAGGTAAATTTTTTCTCCTTGGGTATTACAAGTTTTTGTCTGGAGC 
C AAC C AAGCTT GC CAC CAACAT ATT GAGAGTAATACACTATTGAAAGTT AT CTTGG AT GGG GAG AAAAAAAAAT A 
GTGGTTTTCCTTGTTTGCAAAAACTTCCTTCCTATTCTCATTTTTTCTTAATTTTCTTTAATTTAGTCCAAGTTC 
CAGTTCTTTTAGGCCTTCTCTTTGATTTATTTTCCCCTGCATGTGAGAAGCAGTTCAGAAAAAGGTCTATATCTC 
CACCTCCTAGTGAGTTAGAGTGTTTTCTCAGAGCACCTCTGGGTGGCAAAGGGAAGCATGTTCCTGCCAAGGTTT 
GCTGTGGATTCAGAAGCACCAGGAGCAAGAGACCAGAAGGATGATCTGCTCCTTTGTAACGTTGTTGAGGGCCCT 
CTTGTTTCCAATGAGCAGCTTATAGGTTACTCACAGTCCACTTTCTCACTGGACACACAAAGTGGCTCTTTATCT 
ACCTTTGCGGGAGATTTTCACTCTCCTGCAAATGATCGTTCTCACACTCATATTAGCTCATGTTGGAATTTCCCA 
TCCTGCCATGTCCTTTCCCATTTCTTTTTGGCTTTTTTGCCTCCACCTTTTAGCCCACATCATTTAACTCCACTA 
CTGTGAAAGCTTGCTTAAAGAAAATCCCTCTTGGCCGGGTGTGGTAGCCCACGCCTCTAATCCCAGCACTTTGGG 
AGGCTGAGGCGGGGAGATCACAAGGTCAGGAGATCGAGACCAGCCTGACCAACATGGTGAAACCCTGTCTCTACT 
AAAAATACAAAAATTAGCTGGGCGTGTTGGCACACACCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAA 
TTACTTTAACCTGCGGGGGGAGCCTAGATTGCGCTACTGCACTCCAGCCTAGGCAACAGAGGGAGACTCTGTCTC 
ATTAAAAA 
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FIGURE 82 

MVPPPPSRGGAARGQLGRSLGPLLLLLALGHTWTYREEPEDGDREICSESKIATTKYPCLKSS 
GELTTCYRKKCCKGYKFVLGQCIPEDYDVCAEAPCEQQCTDNFGRVLCTCYPGYRYDRERHRK 
REKPYCLDIDECASSNGTLCAHICINTLGSYRCECREGYIREDDGKTCTRGDKYPNDTGHEKS 
ENMVKAGTCCATCKEFYQMKQTVLQLKQKIALLPNNAADLGKYITGDKVLASNTYLPGPPGLP 
GGQGPPGSPGPKGSPGFPGMPGPPGQPGPRGSMGPMGPSPDLSHIKQGRRGPVGPPGAPGRDG 
SKGERGAPGPRGSPGPPGSFDFLLLMLADIRNDITELQEKVFGHRTHSSAEEFPLPQEFPSYP 
EAMDLGSGDDHPRRTETRDLRAPRDFYP 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 
N-glycosylation sites, 
amino acids 142-148, 182-188 
Tyrosine kinase phosphorylation site, 
amino acids 125-132 
N-myristoylation sites . 

amino acids 10-16, 143-149, 155-161, 196-202, 250-256 
Amidation site. 

amino acids 299-303 

Aspartic acid and asparagine hydroxylation site. 

amino acids 150-162 
Cell attachment sequence, 
amino acids 176-179 
Clq domain proteins. 

amino acids 247-280 

Calcium-binding EGF-like domain proteins pattern proteins. 

amino acids 144-165 
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FIGURE 83 

ATCTGAGTGAGCTAACTGACACAATGAAACTGTCAGGCATGTTTCTGCTCCTCTCTCTGGCTC 
TTTTCTGCTTTTTAACAGGTGTCTTCAGTCAGGGAGGACAGGTTGACTGTGGTGAGTTCCAGG 
ACCGCAAGGTCTACTGCACTCGGGAATCTAACCCACACTGTGGCTCTGATGGCCAGACATATG 
GCAATAAATGTGCCTTCTGTAAGGCCATAGTGAAAAGTGGTGGAAAGATTAGCCTAAAGCATC 
CTGGAAAATGCTGAGTTAAAGCCAATGTTTCTTGGTGACTTGCCAGCTTTTGCAGCCTTCTTT 
TCTCACTTCTGCTTATACTTTTGCTGGTGGATTCCTTTAATTCATAAAGACATACCTACTCTG 
CCTGGGTCTTGAGGAGTTCAATGTATGTCTATTTCTCTTGATTCACTTGTCAATAAAGTACATTC 
TGCAAAAGCAAAAA 
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FIGURE 84 

MKLSGMFLLLSLALFCFLTGVFSQGGQVDCGEFQDPKVYCTRESNPHCGSDGQTYGNKCAFCK 
AIVKSGGKISLKHPGKC 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

N-myristoylation sites. 

amino acids 26-32, 52-58, 56-62, 69-75 

Kazal serine protease inhibitors family signature. 

amino acids 4 0-63 
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FIGURE 85 

GGAGCAGACACACAGACCCGGGCCGGAGGCCCCTCTTCTAGCCCTGCGGGAACCGGACAGTTC 
CCC7\ACTGGGGACTCTGGAACCACAGCTCCTAAATCATCAAATTCTCAAGCTTTTTTTTTCCC 
TCTCTTCGTCCCAGCCATCCCAGTCTTCTTCTTCTTTTTTTTTTTTTTAACTTATTGTTTTTT 
TCGCTCCTGTCATTATGAAAGTGGTCACGCCATTCAATATTAAGACTTGGAGGGAATTGGGGA 
AAGAAAAGAAAGAATCTAAAAGAAGAGAAGCGACCGGTGCTTTTAAGGGTGTCTAATTTTCAA 
AAGAGACGTCTGGGAGTATTTTGCTCTGGGCGTTTGGAGCAACTTCGCGGACAGCGGAGCTCG 
CCCAGCATGGATGTTCCAGGTTCACAGGCGCCTTTCTTCTGAGAACGACCCTGGCCTTGAACG 
TCAGAGCCGGGGACGAAGGCCCCCGGAGGCTGCTGCGAGCTCCGCGCGTTCCTTCGCGCCCTT 
CCGCGCCGCTCGCGCCGGCGCCGGCCTCCACCCCCGCGCGCCGCCTCCCACCAGTCCCGATGC 
AGGCGCCCGGCCGGGGGCCACTCGGGCTGCGGCTGATGATGCCCGGGCGCCGGGGGGCGCTGC 
GCGAGCCTGGCGGCTGCGGATCCTGCCTGGGGGTGGCGCTGGCCCTGCTGTTGCTGCTACTGC 
CCGCCTGCTGCCCCGTGCGGGCGCAGAACGACACGGAGCCCATCGTGCTGGAGGGCAAGTGCC 
TGGTGGTGTGCGACTCCAGCCCGTCGGCGGACGGCGCCGTCACCTCCTCCCTAGGCATCTCCG 
TGCGCTCCGGCAGCGCCAAGGTGGCCTTCTCCGCCACGCGGAGCACCAACCACGAGCCGTCCG 
AGATGAGCAACCGCACCATGACCATCTATTTCGACCAGGTATTAGTAAATATTGGCAACCACT 
TTGATCTTGCTTCCAGTATATTTGTAGCACCGAGAAAAGGGATTTATAGCTTCAGCTTCCACG 
TGGTCAAAGTGTATAACAGACAAACCATCCAGGTCAGTTTAATGCAGAATGGCTACCCAGTGA 
TCTCGGCCTTTGCAGGAGACCAGGATGTCACCAGAGAAGCTGCTAGCAATGGCGTGCTGCTGC 
TCATGGAAAGGGAAGACAAAGTGCATCTCAAACTTGAGAGAGGCAACCTCATGGGGGGCTGGA 
AATACTCCACATTCTCGGGCTTCTTGGTGTTTCCTCTATAAACACAGAGCCCCCTAGATGGTG 
GGGGAATGGCAAACTGGACCCAGGACTCCGCCCTTTAAAACACCCTGAACTTACTGGAATTGG 
ACACCTTGTTTCCAACCTCCGTCAGACTGTTGCAGTAGAAGAATGATTTCCTTTGAAACCTCC 
AGTACTTTTGTTTTTGTTTTTTGGAATACTGACAATTCCTCGGGAACCTGGCCTCTAATTAGT 
TTTAGATGACAAGGTCTTAAGGAGAAATGAAATTATCGATTTGAGCAATTTGTACCTGTGATT 
GTAAAGTCAATATCGGATTTTATTGTTGGGACCATGGACCTCTTTTGTTTGTATGTTGTATTG 
TCGTCCCAACGGAAGGAGAGCTCCTGACTCCAGGATGGGCTGCAGGTTGCAGTCAGGGCTTGA 
AGTAGGAGCCCAGCAAAGAACCACCTGCTGGACAGTCCTTGACATGTGTTCTGTGTGTGTCTG 
TATAGCCTTAAGAAAAAGAATGGCTTCACTTTCATTCTGTATTCTTCCCCCCACCATGTGGCT 
GGGAGGACTTGGGAGGGGGATGGGGACATTGGGAACCTGTCAAGAAGTGCTTTATCCAGAGAA 
GCAAATTTTGCACGATTGGACTGCAATTTTTGTTTTGTATTGTTTGTGTTTTTTCTTGAAAAG 
CTTTACTTTTCTTTCCACACTCAGCTCTCCCTCCTCAACCCCACTTTTATTTTTCTTGCTGGG 
GTTGAGGAGAGAAAATATAGAATTCCTGGATAAGACCAAACAAAACAAAACATTAAAATACCT 
GTATGTTTTGTTTTAGACGAGACCAAACTAAACAAAAAGTATCTGTTTATCAAAGTAAAAGTA 
ACACAATGGACAATTCTGCTTATTCTCTCAAAGAGATTCTAAGATGCACCTTTAGAACTATTA 
ATAGCAACCTGCATTTTTTTTTAATTTATACTTCAGAATCCTTTAAGAACCTGGTGTTCCTGA 
GTGGTCCTGAATCATATAAGTTGGTAATGGAAGCTGTAATGACCAAGTCCCCTAAACATACTA 
TGTCTTTGCCACGTGTGCTGTGACTTCTCTGTGGGTGATTTAATTTATTTGGATCCACCTCTG 
AGTGAGCGCACAGTGATCAGGTGCTTCAAAGCCAACAGACCAGCTCCTCTTCCTCCGGATCCT 
CTTTTGATCTGCCCAGGAAAGGGATGCATTGACACTCTCCTGCATGC^CCTGGCGAGAAGCCA 
CCTGAAAGTCACTGTGGTTAAAGATATTGGTGGAGGTACCCCAGGAGCACTGTTAC7^AATCCT 
TCTTGTTTTGGCATCTCGTACAACATTATTAAGACACAGCTGAGAGTTGATGGGTGTGTAATG 
CATATGCCAAGGAAATGTCACTAATCCCAAAGCAATCAAAAAGGAGACCTCAAACCAGATGTT 
AATTTGTTCTTTGTGTAACAATGTAACCAAAATATTGATGATAAAAGTCATAATTTAAGATTC 
AGAATAAATGGGTTTGATGTCTGGCAAAAAAAAAAAAAAAA 
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FIGURE 86 

MQAPGRGPLGLRLMMPGRRGALREPGGCGSCLGVALALLLLLLPACCPVRAQNDTEPIVLEGK 
CLWCDSSPSADGAVTSSLGISVRSGSAKVAFSATRSTNHEPSEMSNRTMTIYFDQVLVNIGN 
HFDLASSIFVAPRKGIYSFSFHWKVYNRQTIQVSLMQNGYPVISAFAGDQDVTREAASNGVL 
LLMEREDKVHLKLERGNLMGGWKYSTFSGFLVFPL 

Important features of the protein: 
Signal peptide: 

amino acids 1-4 8 

N-glycosylation sites. 

amino acids 53-57, 110-114 

N-myristoylation sites . 

amino acids 26-32, 27-33, 29-35, 33-39, 76-82, 205-211 

Amidation site. 

amino acids 16-20 

Clq domain signature. 

amino acids 117-148 



Clq domain proteins. 

amino acids 115-149 



WO 01/40466 



PCTAJS00/32678 



87/550 

FIGURE 87 

AGGGCCCGCGGGTGGAGAGAGCGACGCCCGAGGGGMGGCGGCAGCGTCCCGGAGCGCCTCTG 

GCTGGGCGCTACTGCTGCTGGTGGCACTTTGGCAGCAGCGCGCGGCCGGCTCCGGCGTCTTCC 

AGCTGCAGCTGCAGGAGTTCATCAACGAGCGCGGCGTACTGGCCAGTGGGCGGCCTTGCGAGC 

CCGGCTGCCGGACTTTCTTCCGCGTCTGCCTTAAGCACTTCCAGGCGGTCGTCTCGCCCGGAC 

CCTGCACCTTCGGGACCGTCTCCACGCCGGTATTGGGCACCAACTCCTTCGCTGTCCGGGACG 

ACAGTAGCGGCGGGGGGCGCAACCCTCTCCAACTGCCCTTCAATTTCACCTGGCCGGGTACCT 

TCTCGCTCATCATCGAAGCTTGGCACGCGCCAGGAGACGACCTGCGGCCAGAGGCCTTGCCAC 

CAGATGCACTCATCAGCAAGATCGCCATCCAGGGCTCCCTAGCTGTGGGTCAGAACTGGTTAT 

TGGATGAGCAAACCAGCACCCTCACAAGGCTGCGCTACTCTTACCGGGTCATCTGCAGTGACA 

ACTACTATGGAGACAACTGCTCCCGCCTGTGCAAGAAGCGCAATGACCACTTCGGCCACTATG 

TGTGCCAGCCAGArGGCAACTTGTCCTGCCTGCCCGGTTGGACTGGGGAATATTGCCAACAGC 

CTATCTGTCTTTCGGGCTGTCATGAACAGAATGGCTACTGCAGCAAGCCAGCAGAGTGCCTCT 

GCCGCCCAGGCTGGCAGGGCCGGCTGTGTAACGAATGCATCCCCCACAATGGCTGTCGCCACG 

GCACCTGCAGCACTCCCTGGCAATGTACTTGTGATGAGGGCTGGGGAGGCCTGTTTTGTGACC 

AAGATCTCAACTACTGCACCCACCACTCCCCATGCAAGAATGGGGCAACGTGCTCCAACAGTG 

GGCAGCGAAGCTACACCTGCACCTGTCGCCCAGGCTACACTGGTGTGGACTGTGAGCTGGAGC 

TCAGCGAGTGTGACAGCAACCCCTGTCGCAATGGAGGCAGCTGTAAGGACCAGGAGGATGGCT 

ACCACTGCCTGTGTCCTCCGGGCTACTATGGCCTGCACTGTGAACACAGCACCTTGAGCTGCG 

CCGACTCCCCCTGCTTCAATGGGGGCTCCTGCCGGGAGCGCAACCAGGGGGCCAACTATGCTT 

GTGAATGTCCCCCCAACTTCACCGGCTCCAACTGCGAGAAGAAAGTGGACAGGTGCACCAGCA 

ACCCCTGTGCCAACGGGGGACAGTGCCTGAACCGAGGTCCAAGCCGCATGTGCCGCTGCCGTC 

CTGGATTCACGGGCACCTACTGTGAACTCCACGTCAGCGACTGTGCCCGTAACCCTTGCGCCC 

ACGGTGGCACTTGCCATGACCTGGAGAAT-GGGCTCATGTGCACCTGCCCTGCCGGCTTCTCTG 

GCCGACGCTGTGAGGTGCGGACATCCATCGATGCCTGTGCCTCGAGTCCCTGCTTCAACAGGG 

CCACCTGCTACACCGACCTCTCCACAGACACCTTTGTGTGCAACTGCCCTTATGGCTTTGTGG 

GCAGCCGCTGCGAGTTCCCCGTGGGCTTGCCGCCCAGCTTCCCCTGGGTGGCCGTCTCGCTGG 

GTGTGGGGCTGGCAGTGCTGCTGGTACTGCTGGGCATGGTGGCAGTGGCTGTGCGGCAGCTGC ■ 

GGCTTCGACGGCCGGACGACGGCAGCAGGGAAGCCATGAACAACTTGTCGGACTTCCAGAAGG 

ACAACCTGATTCCTGCGGCCCAGCTTAAAAACAGAAACCAGAAGAAGGAGCTGGAAGTGGACT 

GTGGCCTGGACAAGTCCAACTGTGGCAAACAGCAAAACCACACATTGGACTATAATCTGGCCC 

CAGGGCCCCTGGGGCGGGGGACCATGCCAGGAAAGTTTCCCCACAGTGACAAGAGCTTAGGAG 

AGAAGGCGCCACTGCGGTTACACAGTGAAAAGCCAGAGTGTCGGATATCAGCGATATGCTCCC 

CCAGGGACTCCATGTACCAGTCTGTGTGTTTGATATCAGAGGAGAGGAATGAATGTGTCATTG 

CCACGGAGGTATAAGGCAGGAGCCTACCTGGACATCCCTGCTCAGCCCCGCGGCTGGACCTTC 

CTTCTGCATTGTTTACA 
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FIGURE 88 

MAAASRSASGWALLLLVALWQQRAAGSGVFQLQLQEFINERGVLASGRPCEPGCRTFFRVCLK 
HFQAVVSPGPCTFGTVSTPVLGTNSFAVRDDSSGGGRNPLQLPFN FTWPGTFSLIIEAWHAPG 
DDLRPEALPPDALISKIAIQGSLAVGQNWLLDEQTSTLTRLRYSYRVICSDNYYGDNCSRLCK 
KRNDHFGHYVCQPDGNLSCLPGWTGEYCQQPICLSGCHEQNGYCSKPAECLCRPGWQGRLCNE 
CIPHNGCRHGTCSTPWQCTCDEGWGGLFCDQDLNYCTHHSPCKNGATCSNSGQRSYTCTCRPG 
YTGVDCELELSECDSNPCRNGGSCKDQEDGYHCLCPPGYYGLHCEHSTLSCADSPCFNGGSCR 
ERNQGANYACECPPNFTGSNCEKKVDRCTSNPCANGGQCLNRGPSRMCRCRPGFTGTYCELHV 
SDCARNPCAHGGTCHDLENGLMCTCPAGFSGRRCEVRTSIDACASSPCFNRATCYTDLSTDTF 
VCNCPYGFVGSRCEFPVGLPPSFPWVAVSLGVGLAVLLVLLGMVAVAVRQLRLRRPDDGSREA 
MNNLSDFQKDNLIPAAQLKNTNQKKELEVDCGLDKSNCGKQQNHTLDYNLAPGPLGRGTMPGK 
FPHSDKSLGEKAPLRLHSEKPECRISAICSPRDSMYQSVCLISEERNECVIATEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 
Transmembrane domain: 
amino acids 530-552 
N-glycosylation sites . 

amino acids 108-112, 183-187, 205-209, 393-397, 570-574, 610-614 
Glycosaminoglycan attachment site . 

amino acids 96-100 

Tyrosine kinase phosphorylation site. 

amino acids 340-347 
N-myristoylation sites. 

amino acids 42-48, 204-210, 258-264, 277-283, 297-303, 383-389, 
415-421, 461-467, 522-528, 535-541, 563-569, 599-605, 625-631 
Amidation site, 
amino acids 471-475 

Aspartic acid and asparagine hydroxy la ti on site. 

amino acids 339-351 

EGF-like domain cysteine pattern signature. 

amino acids 173-185, 206-218, 239-251, 270-282, 310-322, 348-360, 
388-400, 426-438, 464-476, 506-518 
Calcium-binding EGF-like : 

amino acids 224-245, 255-276, 295-316, 333-354, 373-394, 411-432, 
449-470 



WO 01/40466 



PCT/US00/32678 



89/550 

FIGURE 89 

GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGAC 
TTGACTCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGC 
CGCCCAGTCCCGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCC 
TCCTTTTCATTCATAACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCC 
GAGCGTGGAAGAATGGGGTTCCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATT 
CAAGCTTTCCCCAAACCTGGAGGAAGCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCA 
GAAAGACCTTTGAATGAACAGATTGCTGAAGCAGAAGAAGACAAGATTAAAAAAACATATCCT 
CCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTTTGTTGATAACTTGAACCTGCTAAAGGCA 
ATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTATAAGAAGCTCCCCACTTGATAAT 
AAGTTGAATGTGGAAGATGTTGATTCAACCAAGAATCGAAAACTGATCGATGATTATGACTCT 
ACTAAGAGTGGATTGGATCATAAATTTCAAGATGATCCAGATGGTCTTCATCAACTAGACGGG 
ACTCCTTTAACCGCTGAAGACATTGTCCATAAAATCGCTGCCAGGATTTATGAAGAAAATGAC 
AGAGCCGTGTTTGACAAGATTGTTTCTAAACTACTTAATCTCGGCCTTATCACAGAAAGCCAA 
GCACATACACTGGAAGATGAAGTAGCAGAGGTTTTACAAAAATTAATCTCAAAGGAAGCCAAC 
AATTATGAGGAGGATCCCAATAAGCCCACAAGCTGGACTGAGAATCAGGCTGGAAAAATACCA 
GAGAAAGTGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACA 
GTATCTAACACATTAACCTTGACAAATGGCTTGGAAAGGAGAACTAAAACCTACAGTGAAGAC 
AACTTTGAGGAACTCCAATATTTCCCAAATTTCTATGCGCTACTGAAAAGTATTGATTCAGAA 
AAAGAAGCAAAAGAGAAAGAAACACTGATTACTATCATGAAAACACTGATTGACTTTGTGAAG 
ATGATGGTGAAATATGGAACAATATCTCCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGAT 
GAAATGATTGCTCTTCAGACCAAAAACAAGCTAGAAAAAAATGCTACTGACAATATAAGCAAG 
CTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAACAGACAGTACCAAGGAAGAAGCAGCT 
AAGATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAAAAGATGATAACTCCAACCCAGGA 
GGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGCCATCAGAAAAAATATT 
GAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTTCAAAGATGAGA 
GACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAAGAAGCC 
GAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAAC 
TGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCC 
AAGGGTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGC 
TTTCTTCCCGTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAA 
AAA 
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FIGURE 90 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENK 
PGQSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSG 
LDHKFQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTL 
EDEVAEVLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNT 
LTLTNGLERRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVK 
YGTISPEEGVSYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEK 
EYGSLKDSTKDDNSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFIN 
KQ ADA Y VE KG I L DKE EAE A I KR I Y S S L 

Important features: 
N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 
220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 91 

TGCATCAGTGCCCAGGCAAGCCCAGGAGTTGACATTTCTCTGCCCAGCCATGGGCCTCACCCT 
GCTCTTGCTGCTGCTCCTGGGACTAGAAGGTCAGGGCATAGTTGGCAGCCTCCCTGAGGTGCT 
GCAGGCACCCGTGGGAAGCTCCATTCTGGTGCAGTGCCACTACAGGCTCCAGGATGTCAAAGC 
TCAGAAGGTGTGGTGCCGGTTCTTGCCGGAGGGGTGCCAGCCCCTGGTGTCCTCAGCTGTGGA 
TCGCAGAGCTCCAGCGGGCAGGCGTACGTTTCTCACAGACCTGGGTGGGGGCCTGCTGCAGGT 
GGAAATGGTTACCCTGCAGGAAGAGGATGCTGGCGAGTATGGCTGCATGGTGGATGGGGCCAG 
GGGGCCCCAGATTTTGCACAGAGTCTCTCTGAACATACTGCCCCCAGAGGAAGAAGAAGAGAC 
CCATAAGATTGGCAGTCTGGCTGAGAACGCATTCTCAGACCCTGCAGGCAGTGCCAACCCTTT 
GGAACCCAGCCAGGATGAGAAGAGCATCCCCTTGATCTGGGGTGCTGTGCTCCTGGTAGGTCT 
GCTGGTGGCAGCGGTGGTGCTGTTTGCTGTGATGGCCAAGAGGAAACAAGAATCCCTCCTCAG 
TGGTCCACCACGTCAGTGACTCTGGACCGGCTGCTGAATTGCCTTTGGATGTACCACACATTA 
GGCTTGACTCACCACCTTCATTTGACAATACCACCTACACCAGCCTACCTCTTGATTCCCCAT 
CAGGAAAACCTTCACTCCCAGCTCCATCCTCATTGCCCCCTCTACCTCCTAAGGTCCTGGTCT 
GCTCCAAGCCTGTGACATATGCCACAGTAATCTTCCCGGGAGGGAACAAGGGTGGAGGGACCT 
CGTGTGGGCCAGCCCAGAATCCACCTAACAATCAGACTCCATCCAGCTAAGCTGCTCATCACA 
CTTTAAACTCATGAGGACCATCCCTAGGGGTTCTGTGCATCCATCCAGCCAGCTCATGCCCTA 
GGATCCTTAGGATATCTGAGCAACCAGGGACTTTAAGATCTAATCCAATGTCCTAACTTTACT 
AGGGAAAGTGACGCTCAGACATGACTGAGATGTCTTGGGGAAGACCTCCCTGCACCCAACTCC 
CCCACTGGTTCTTCTACCATTACACACTGGGCTAAATAAACCCTAATAATGATGTGCAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



WO 01/40466 



PCT/US00/32678 



92/550 

FIGURE 92 

MGLTLLLLLLLGLEGQGIVGSLPEVLQAPVGSSILVQCHYRLQDVKAQKVWCRFLPEGCQPLV 
SSAVDRRAPAGRRTFLTDLGGGLLQVEMVTLQEEDAGEYGCMVDGARGPQILHRVSLNILPPE 
EEEETHKIGSLAENAFSDPAGSANPLEPSQDEKSIPLIWGAVLLVGLLVAAVVLFAVMAKRKQ 
ESLLSGPPRQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 161-181 

N-myristoylation sites. 

amino acids 17-23, 172-178 

Amidation site. 

amino acids 73-7 9 
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FIGURE 93 

GGCGGCGTTGCCGGGCTCTCCGGAAGGAGACGTGGCGGCGGTTGGGCCGGTGATACCCGGGCG 
CTTTATAGTCCCGCCGCCTCCTCCTCCACCTCCTCCTCCTCCTCCTCTCCTCCTGGGGCAGAG 
GAGGTTGTGGCGGTGGCTGGAGAAAGCGGCGGCGGAGGATGGAGGAAGGAGGCGGCGGCGTAC 
GGAGTCTGGTCCCGGGCGGGCCGGTGTTACTGGTCCTCTGCGGCCTCCTGGAGGCGTCCGGCG 
GCGGCCGAGCCCTTCCTCAACTCAGCGATGACATCCCTTTCCGAGTCAACTGGCCCGGCACCG 
AGTTCTCTCTGCCCACAACTGGAGTTTTATATAAAGAAGATAATTATGTCATCATGACAACTG 
CACATAAAGAAAAATATAAATGCATACTTCCCCTTGTGACAAGTGGGGATGAGGAAGAAGAAA 
AGGATTATAAAGGCCCTAATCCAAGAGAGCTTTTGGAGCCACTATTTAAACAAAGCAGTTGTT 
CCTACAGAATTGAGTCTTATTGGACTTACGAAGTATGTCATGGAAAACACATTCGGCAGTACC 
ATGAAGAGAAAGAAACTGGTCAGAAAATAAATATTCACGAGTACTACCTTGGGAATATGTTGG 
CCAAGAACCTTCTATTTGAAAAAGAACGAGAAGCAGAAGAAAAGGAAAAATCT^AATGAGATTC 
CCACTAAAAATATCGAAGGTCAGATGACACCATACTATCCTGTGGGAATGGGAAATGGTACAC 
CTTGTAGTTTGAAACAGAACCGGCCCAGATCAAGTACTGTGATGTACATATGTCATCCTGAAT 
CTAAGCATGAAATTCTTTCAGTAGCTGAAGTTACAACTTGTGAATATGAAGTTGTCATTTTGA 
CACCACTCTTGTGCAGTCATCCTAAATATAGGTTCAGAGCATCTCCTGTGAATGACATATTTT 
GTCAATCACTGCCAGGATCTCCATTTAAGCCCCTCACCCTGAGGCAGCTGGAGCAGCAGGAAG 
AAATACTAAGGGTGCCTTTTAGGAGAAATAAAGAGGGTGTCGGTTGGTGGAAATATGAATTCT 
GCTATGGCAAACATGTACATCAATACCATGAGGACAAGGATAGTGGGAAAACCTCTGTGGTTG 
TCGGGACATGGAACCAAGAAGAGCATATTGAATGGGCTAAGAAGAATACTGCTAGAGCTTATC 
ATCTTCAAGACGATGGTACCCAGACAGTCAGGATGGTGTCACATTTTTATGGAAATGGAGATA 
TTTGTGATATAACTGACAAACCAAGACAGGTGACTGTAAAACTAAAGTGCAAAGAATCAGATT 
CACCTCATGCTGTTACTGTATATATGCTAGAGCCTCACTCCTGTCAATATATTCTTGGGGTTG 
AATCTCCAGTGATCTGTAAAATCTTAGATACAGCAGATGAAAATGGACTTCTTTCTCTCCCCA 
ACTAAAGGATATTAAAGTTAGGGGAAAGAAAAGATCATTGAAAGTCATGATAATTTCTGTCCC 
ACTGTGTCTCATTATAGAGTTCTCAGCCATTGGACCTCTTCTAAAGGATGGTATAAAATGACT 
CTCAACCACTTTGTGAATACATATGTGTATATAAGAGGTTATTGATAAACTTCTGAGGCAGAC 
ATTTGTCTCGCTTTTTTTCATTTTTGTTGTGTCTTATAAACTGACTGTTTTTCTTTGCTTGGA 
TACTGTGATTCCAAAATAAATCTCATCCAAGCAAGTTAGAGTCCAGCCTAATCAAATGTCATA 
ATTGTTGTACCTATTGAAAGTTTTTAAATAATAGATTTATTATGTAAATTATAGTATATGTAA 
GTAGCTAATGAAGTAAAGATCATGAAGAAAGAAATTGATAGGTGTAAATGAGAGACCATGTAA 
AATATGTAAATTCTAGTACCTGAAATCCTTTCAACAGATTTTTATATAGCAACTGCTCTCTGC 
AAGTAGTTAAACTAGAAACTGGGCACATGGTAGAGGCTCACATGGGAGTTGTCCTCACCCTTG 
TTAATCTCAAGAAACTCTTATTTATAATAGGTTGCTTCTCTCTCAGAACTTTTATCTATTACT 
TTTTTCTTCTTATGAGTATGTTTACTCTCAGAGTATCTATCTGATGTAGACAGTTGGTGATGC 
TTCTGAGACTCAGAATGGTTTACTCTAACAAAACACTGTGCTGTCTATCCCTTGTACTTGCCT 
ACTGTAATATGGATTTCACTTCTGAACAGTTTACAGCACAATATTTATTTTAAAGTGAATAAA 
ATGTCCACAAGCAAAAA 
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FIGURE 94 

MEEGGGGVRSLVPGGPVLLVLCGLLEASGGGRALPQLSDDIPFRVNWPGTEFSLPTTGVLYKE 
DNYVIMTTAHKEKYKCILPLVTSGDEEEEKDYKGPNPRELLEPLFKQSSCSYRIESYWTYEVC 
HGKH I RQYHEEKETGQKIN I HE Y YLGNMLAKNLLFEKEREAE EKEKSN E I PTKN I EGQMT P Y Y 
PVGMGNGTPCSLKQNRPRSSTVMYICHPESKHEILSVAEVTTCEYEWILTPLLCSHPKYRFR 
ASPVNDIFCQSLPGSPFKPLTLRQLEQQEEILRVPFRRNKEGVGWWKYEFCYGKHVHQYHEDK 
DSGKTSVWGTWNQEEHIEWAKKNTARAYHLQDDGTQTVRMVSHFYGNGDICDITDKPRQVTV 
KLKCKESDSPHAVTVYMLEPHSCQYILGVESPVICKILDTADENGLLSLPN 

Important features of the protein: 
Signal peptide: 

amino acids 1-30 

Glycosaminoglycan attachment site. 

amino acids 28-32 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 337-341 

N-myristoylation sites. 

amino acids 6-12, 23-29, 29-35, 49-55, 141-147, 152-158, 192-198, 
196-202 

Gram-positive cocci surface proteins 'anchoring 1 hexapeptide. 

amino acids 54-60 
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FIGURE 95 

TTCCGTTTCTGGGAGGAGTGAGGGGCAACGGGTCGGAGAAAAAGGAAAAAAGAAGGGCTCAGC 
GCCTCCCCGCCGGGCCGTGGACAGAGGGGCACAGTTTCGGCAGGCGGGTGAGGTCGCTGAGGG 
CCCGCCGGAGATGTTTTCCTTGTCGAGCACGGTGCAACCCCAGGTTACAGTTCCTCTGAGTCA 
TCTCATCAATGCCTTCCATACACCAAAAAACACTTCTGTTTCTCTCAGTGGAGTGTCAGTTTC 
TCAAAACCAGCATCGAGATGTAGTTCCTGAGCATGAGGCTCCCAGCAGTGAGCCTTCACTTAA 
CTTAAGGGACCTTGGATTATCTGAACTAAAAATTGGACAGATTGATCAGCTGGTAGAAAATCT 
ACTTCCTGGATTTTGTAAAGGCAAAAACATTTCTTCCCATTGGCATACATCCCATGTCTCTGC 
ACAATCCTTCTTTGAAAATAAATATGGTAACTTAGATATATTTAGTACATTACGTTCCTCTTG 
CTTGTATCGACATCATTCAAGAGCTCTTCAAAGCATTTGTTCAGATCTTCAGTACTGGCCAGT 
TTTCATACAGTCTCGGGGTTTTAAAACTTTGAAATCAAGGACACGACGTCTCCAGTCTACCTC 
CGAGAGATTAGCTGAAACACAGAATATAGCGCCATCATTCGTGAAGGGGTTTCTTTTGCGGGA 
CAGAGGATCAGATGTTGAGAGTTTGGACAAACTCATGAAAACCAAAAATATACCTGAAGCTCA 
CCAAGATGCATTTAT^AACTGGTTTTGCGGAAGGTTTTCTGAAAGCTCAAGCACTCACACAAAA 
AACCAATGATTCCCTAAGGCGAACCCGTCTGATTCTCTTCGTTCTGCTGCTATTCGGCATTTA 
TGGACTTCTAAAAAACCCATTTTTATCTGTCCGCTTCCGGACAACAACAGGGCTTGATTCTGC 
AGTAGATCCTGTCCAGATGAAAAATGTCACCTTTGAACATGTTAAAGGGGTGGAGGAAGCTAA 
ACAAGAATTACAGGAAGTTGTTGAATTCTTGAAAAATCCACAAAAATTTACTATTCTTGGAGG 
TAAACTTCCAAAAGGAATTCTTTTAGTTGGACCCCCAGGGACTGGAAAGACACTTCTTGCCCG 
AGCTGTGGCGGGAGAAGCTGATGTTCCTTTTTATTATGCTTCTGGATCCGAATTTGATGAGAT. 
GTT.TGTGGGTGTGGGAGCCAGCCGTATCAGAAATCTTTTTAGGGAAGCAAAGGCGAATGCTCC 
TTGTGTTATATTTATTGATGAATTAGATTCTGTTGGTGGGAAGAGAATTGAATCTCCAATGCA 
TCCATATTCAAGGCAGACCATAAATCAACTTCTTGCTGAAATGGATGGTTTTAAACCCAATGA 
AGGAGTTATCATAATAGGAGCCAC7\AACTTCCCAGAGGCATTAGATAATGCCTTAATACGTCC 
TGGTCGTTTTGACATGCAAGTTACAGTTCCAAGGCCAGATGTAAAAGGTCGAACAGAAATTTT 
GAAATGGTATCTCAATAAAATAAAGTTTGATCAATCCGTTGATCCAGAAATTATAGCTCGAGG 
TACTGTTGGCTTTTCCGGAGCAGAGTTGGAGAATCTTGTGAACCAGGCTGCATTAAAAGCAGC 
TGTTGATGGAAAAGAAATGGTTACCATGAAGGAGCTGGAGTTTTCCAAAGACAAAATTCTAAT 
GGGGCCTGAAAGAAGAAGTGTGGAAATTGATAACAAAAACAAAACCATCACAGCATATCATGA 
ATCTGGTCATGCCATTATTGCATATTACACAAAAGATGCAATGCCTATCAACAAAGCTACAAT 
CATGCCACGGGGGCCAACACTTGGACATGTGTCCCTGTTACCTGAGAATGACAGATGGAATGA 
AACTAGAGCCCAGCTGCTTGCACAAATGGATGTTAGTATGGGAGGAAGAGTGGCAGAGGAGCT 
TATATTTGGAACCGACCATATTACAACAGGTGCTTCCAGTGATTTTGATAATGCCACTAAAAT 
AGCAAAGCGGATGGTTACC7^AATTTGGAATGAGTGAAAAGCTTGGAGTTATGACCTACAGTGA 
TACAGGGAAACTAAGTCCAGAAACCCAATCTGCCATCGAACAAGAAATAAGAATCCTTCTAAG 
GGACTCATATGAACGAGCAAAACATATCTTGAAAACTCATGCAAAGGAGCATAAGAATCTCGC 
AGAAGCTTTATTGACCTATGAGACTTTGGATGCCAAAGAGATTCAAATTGTTCTTGAGGGGAA 
AAAGTTGGAAGTGAGATGATAACTCTCTTGATATGGATGCTTGCTGGTTTTATTGCAAGAATA 
TAAGTAGCATTGCAGTAGTCTACTTTTACAACGCTTTCCCCTCATTCTTGATGTGGTGTAATT 
GAAGGGTGTGAAATGCTTTGTCAATCATTTGTCACATTTATCCAGTTTGGGTTATTCTCATTA 
TGACACCTATTGCAAATTAGCATCCCATGGCAAATATATTTTGAAAAAATAAAGAACTATCAG 
GAT T G AAAAC AAAAAAAAAAAA 
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FIGURE 96 

MFSLSSTVQPQVTVPLSHLINAFHTPKNTSVSLSGVSVSQNQHRDVVPEHEAPSSEPSLNLRD 
LGLSELKIGQIDQLVENLLPGFCKGKNISSHWHTSHVSAQSFFENKYGNLDIFSTLRSSCLYR 
HHSRALQSICSDLQYWPVFIQSRGFKTLKSRTRRLQSTSERLAETQNIAPSFVKGFLLRDRGS 
DVESLDKLMKTKNIPEAHQDAFKTGFAEGFLKAQALTQKTNDSLRRTRLILFVLLLFGIYGLL 
KNPFLSVRFRTTTGLDSAVDPVQMKNVTFEHVKGVEEAKQELQEVVEFLKNPQKFTILGGKLP 
KGILLVGPPGTGKTLLARAVAGEADVPFYYASGSEFDEMFVGVGASRIRNLFREAKANAPCVI 
FIDELDSVGGKRIESPMHPYSRQTINQLLAEMDGFKPNEGVIIIGATNFPEALDNALIRPGRF 
DMQVTVPRPDVKGRTEILKWYLNKIKFDQSVDPEIIARGTVGFSGAELENLVNQAALKAAVDG 
KEMVTMKELEFSKDKILMGPERRSVEIDNKNKTITAYHESGHAIIAYYTKDAMPINKATIMPR 
GPTLGHVSLLPENDRWNETRAQLLAQMDVSMGGRVAEELIFGTDHITTGASSDFDNATKIAKR 
MVTKFGMSEKLGVMTYSDTGKLSPETQSAIEQEIRILLRDSYERAKHILKTHAKEHKNLAEAL 
LTYETLDAKEIQIVLEGKKLEVR 

Important features of the protein: 
Transmembrane domain: 

amino acids 238-259 

N-glycosylation sites. 

amino acids 28-32, 90-94, 230-234, 278-282, 535-539, 584-588, 
623-627 

N-myristoylation sites. 

amino acids 35-41, 266-272, 286-292, 325-331, 357-363, 599-605 

Amidation site. 

amino acids 387-393, 709-713 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 322-330 

AAA-protein family proteins 

amino acids 315-336, 343-386, 405-451 
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FIGURE 97 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGG 
GACCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAA 
ACAAGTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTT 
CCAGGCCTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACG 
ACCTGTGCCACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAG 
GCTTGGCAGTTTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGMGAAAGCCTCTAGT 
CTTGCCTTCAGCCTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAG 
ACACTCAATTTGGGAAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCT 
GAGATACGGGGCAGTGTGCAAGCCAAAGATGGAAACATTGACATCAGAATCTTAAGGAGGACT 
GAGTCTTTGCAAGACACAAAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTC 
TATCTGGACAGGGTATTTAAAAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGC 
AGCCTCGCCAATTCCTTTCTTACCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACA 
TGCCATTGTGGGGAGGAAGCAATGAAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTG 
GAACCTCAGGCAGCAGTTGTGAAGGCTTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAG 
GAGACAGAATAGGAGGAAAGTGATGCTGCTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCT 
TCAATACCTGCAGAGGAGGCATGACCCCAAACCACCATCTCTTTACTGTACTAGTCTTGTGCT 
GGTCACAGTGTATCTTATTTATGCATTACTTGCTTCCTTGCATGATTGTCTTTATGCATCCCC 
AATCTTAATTGAGACCATACTTGTATAAGATTTTTGTAATATCTTTCTGCTATTGGATATATT 
TATTAGTTAATATATTTATTTATTTTTTGCTATTTAATGTATTTATTTTTTTACTTGGACATG 
AAACTTTAAAAAAATTCACAGATTATATTTATAACCTGACTAGAGCAGGTGATGTATTTTTAT 
ACAGTAAAAAAAAAAAACCTTGTAAATTCTAGAAGAGTGGCTAGGGGGGTTATTCATTTGTAT 
TCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGATATTTGAAATTGAACCAATGAC 
TACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCACATCTACCTTACAATTACTG 
ACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAGCCAGGAATCCTACACGG 
CCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 98 

MKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNGFSEIRGSVQAKDGNIDIR 
ILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKISSLANSFLTIKKDLRLC 
HAHMTCHCGEEAMKKYSQILSHFEKLEPQAAWKALGELDILLQWMEETE 

Signal sequence: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 107-110, 140-143 

N-myristoylation site. 

amino acids 51-56 



Interleukin 10 : 

amino acids 9-176 
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FIGURE 99 

GCGCCGGCTCCGCGCCTCGCGCCCAGTCCGCGGGCCGCGCCGCCGCTCCCGCCGCTCCCGCCG 

CTCCCGCAGCCGCCCCGCCGCCCGCCCGGAGCCCCGCGTCCCTAGGCCTGGCTCCCGCCTGCC 

CGAGACCCGCCCAGCCTGCCCCGCTCAGCCGCCAGAGAAGATGCGGCTGCTCCCGGAATGGTT 

CCTCTTGCTCTTTGGCCCGTGGCTCCTTAGGAAGGCCGTCAGTGCCCAGATACCAGAGTCCGG 

AAGGCCGCAGTACCTGGGGCTGCGCCCCGCCGCGGCCGGAGCGGGTGCCCCCGGCCAGCAGCT 

CCCAGAGCCAAGGTCTTCGGACGGCCTAGGCGTGGGCCGCGCCTGGAGCTGGGCCTGGCCGAC 

CAACCACACGGGGGCGCTGGCCCGGGCAGGGGCAGCCGGGGCGTTGCCCGCGCAGCGCACCAA 

GAGGAAGCCGTCCATCAAGGCGGCGCGCGCCAAAAAGATCTTCGGCTGGGGGGACTTCTACTT 

TCGGGTGCATACCCTCAAGTTTTCGCTGCTGGTGACCGGCAAGATCGTGGACCATGTGAACGG 

TACCTTCAGTGTGTATTTCCGCCAGAACTCGTCCAGCCTGGGCAACCTCAGTGTCAGCATCGT 

GCCGCCCTCCAAGCGTGTCGAGTTCGGAGGAGTCTGGCTGCCCGGGCCTGTCCCCCACCCTCT 

GCAGTCTACGCTCGCCCTGGAGGGGGTGCTTCCTGGGCTGGGGCCCCCGCTGGGGATGGCAGC 

AGCAGCGGCGGGGCCGGGGCTTGGGGGCTCCCTCGGGGGCGCACTGGCGGGGCCGCTTGGGGG 

CGCGTTGGGAGTGCCTGGGGCCAAAGAGTCACGCGCTTTCAATTGCCACGTGGAGTATGAGAA 

GACAAACCGCGCGCGCAAGCACCGACCGTGCCTGTACGACCCGTCGCAGGTGTGTTTCACCGA 

GCACACGCAGAGCCAGGCCGCCTGGCTCTGTGCCAAGCCCTTCAAAGTCATCTGTATCTTCGT 

CTCTTTCCTCAGCTTTGACTACAAACTGGTGCAGAAGGTGTGCCCAGACTATAACTTCCAGAG 

TGAGCACCCCTACTTCGGA3CAGCGCCCCTCCCCAGCCAGTCCTGAGCCTCCCGCCAAATCCCA 

GCCTCACTAGGTGGGACCCCCTTCCCAGTGTTCTGCCGCTCCTGTGGCCATGTCGCCCACTCC 

TTCCACTCTGGGGGCGGAGGGGAATGGCTTCTCGGGACCCTCAGCTAGCGTGGGTGCCCTTTT 

CCTTATGCGGAGTGCCCGCAAGGCTGGGGTAGCCCCCTCCAGTACACCCCAAAGTGAAAGGGA 

TAAGAGTGCAGCCCCAGAATAGGCGGGGCTTGGAGGCGGTCCCAATGTCCCCTGGGTCCACAG 

TGGGTCCCCTTTTCACCCTTGGCGCTAGGCTGCGCACTCCCTTTCCCCGCAGCTTTAATAACT' 

CCTGGCCTGGCACCCTCACCCCACCCTGACTTTCCCATCCCCCAGCGCTTGTCCTGCTTCACC 

ATACCCCGCCTAAGACTGTAAAGGCCTAAAAACCTCGGCCTGTCCTCCCACCATTCTGC.CTGC 

CATATGCCTGTCCCCTTTTCCTCCAAACCCTATTAGGGTACCGGAAGCAGAACCCCTGGGCTG 

AGGCCCTGGCCCTGCCCCCGGCCCCTGCCCCTGCCCGCCCCCCTCCAGTCCAGGCAGTCGAGC 

TCCACCTGCCCTCTCCTGCTGCTTCCTCTCGGTGATATTTTTTCTACGCCAAAACAGACGGGA 

AAGGGAACAAAATAAAGTGAAATCCCAAAAAAAAAAAAAAAAAAAAAA7U^AAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAA 
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FIGURE 100 

MRLLPEWFLLLFGPWLLRKAVSAQIPESGRPQYLGLRPAAAGAGAPGQQLPEPRSSDGLGVGR 
AWSWAWPTNHTGALARAGAAGALPAQRTKRKPSIKAARAKKIFGWGDFYFRVHTLKFSLLVTG 
KIVDHVNGTFSVYFRHNSSSLGNLSVSIVPPSKRVEFGGVWLPGPVPHPLQSTLALEGVLPGL 
GPPLGMAAAAAGPGLGGSLGGALAGPLGGALGVPGAKESRAFNCHVEYEKTNRARKHRPCLYD 
PSQVCFTEHTQSQAAWLCAKPFKVICIFVSFLSFDYKLVQKVCPDYNFQSEHPYFG 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 273-288 

N-glycosylation sites. 

amino acids 72-76, 133-137, 143-147, 149-153 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 93-97 

N-myristoylation sites . 

amino acids 35-41, 58-64, 60-66, 81-87, 84-90, 184-190, 194-200, 
203-209, 205-211, 206-212, 209-215, 217-223, 221-227, 224-230 

Cytochrome b/b6 Qo site signature. 

amino acids 5-11 
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FIGURE 101 

AATGCCCCATGCGCACCCCACAGCTCGCGCTCCTGCAAGTGTTCTTTCTGGTGTTCCCCGATG 
GCGTCCGGCCTCAGCCCTCTTCCTCCCCATCAGGGGCAGTGCCCACGTCTTTGGAGCTGCAGC 
GAGGGACGGATGGCGGAACCCTCCAGTCCCCTTCAGAGGCGACTGCAACTCGCCCGGCCGTGC 
CTGGACTCCCTACAGTGGTCCCTACTCTCGTGACTCCCTCGGCCCCTGGGAATAGGACTGTGG 
ACCTCTTCCCAGTCTTACCGATCTGTGTCTGTGACTTGACTCCTGGAGCCTGCGATATAAATT 
GCTGCTGCGACAGGGACTGCTATCTTCTCCATCCGAGGACAGTTTTCTCCTTCTGCCTTCCAG 
GCAGCGTAAGGTCTTCAAGCTGGGTTTGTGTAGACAACTCTGTTATCTTCAGGAGTAATTCCC 
CGTTTCCTTCAAGAGTTTTCATGGATTCTAATGGAATCAGGCAGTTTTGTGTCCATGTGAACA 
ACTCAAACTTAAACTATTTCCAGAAGCTTCAAAAGGTCAATGCAACCAACTTCCAGGCCCTGG 
CTGCAGAGTTTGGAGGCGAATCATTCACTTCAACATTCCAAACTCAATCACCACCATCTTTTT 
ACAGGGCTGGGGACCCCATTCTTACTTACTTCCCCAAGTGGTCTGTAATAAGCTTGCTGAGAC 
AACCTGCAGGAGTTGGAGCTGGGGGACTCTGTGCTGAAAGCAATCCTGCAGGTTTCCTAGAGA 
GTAAAAGTACAACTTGCACTCGTTTTTTCAAGAACCTGGCTAGTAGCTGTACCTTGGATTCAG 
CCCTCAATGCTGCCTCTTACTAT7^ACTTCACAGTCTTAAAGGTTCCAAGAAGCATGACTGATC 
CACAGAATATGGAGTTCCAGGTTCCTGTAATACTTACCTCACAGGCTAATGCTCCTCTGTTGG 
CTGGAAACACTTGTCAGAATGTAGTTTCTCAGGTCACCTATGAGATAGAGACCAATGGGACTT 
TTGGAATCCAGAAAGTTTCTGTCAGTTTGGGACAAACCAACCTGACTGTTGAGCCAGGCGCTT 
CCTTACAGCAACACTTCATCCTTCGCTTCAGGGCTTTTCAACAGAGCACAGCTGCTTCTCTCA 
CCAGTCCTAGAAGTGGGAATCCTGGCTATATAGTTGGGAAGCCACTCTTGGCTCTGACTGATG 
ATATAAGTTACTCAATGACCCTCTTACAGAGCCAGGGTAATGGAAGTTGCTCTGTTAAAAGAC 
ATGAAGTGCAGTTTGGAGTGAATGCAATATCTGGATGCAAGCTCAGGTTGAAGAAGGCAGACT 
GCAGCCACTTGCAGCAGGAGATTTATCAGACTCTTCATGGAAGGCCCAGACCAGAGTATGTTG 
CCATCTTTGGTAATGCTGACCCAGCCCAGAAAGGAGGGTGGACCAGGATCCTCAACAGGCACT 
GCAGCATTTCAGCTATAAACTGTACTTCCTGCTGTCTCATACCAGTTTCCCTGGAGATCCAGG 
TATTGTGGGCATATGTAGGTCTCCTGTCCAACCCGCAAGCTCATGTATCAGGAGTTCGATTCC 
TATACCAGTGCCAGTCTATACAGGATTCTCAGCAAGTTACAGAAGTATCTTTGACAACTCTTG 
TGAACTTTGTGGACATTACCCAGAAGCCACAGCCTCCAAGGGGCCAACCCAAAATGGACTGGA 
AATGGCCATTCGACTTCTTTCCCTTCAAAGTGGCATTCAGCAGAGGAGTATTCTCTCAAAAAT 
GCTCAGTCTCTCCCATCCTTATCCTGTGCCTCTTACTACTTGGAGTTCTCAACCTAGAGACTA 
TGTGAAGAAAAGAAAATAATCAGATTTCAGTTTTCCCTATGAGAAACTCTGAGGCAGCCACTT 
ATCTTGGCTAAATAGAACCTCACCTGCTCATGACCAGAGAGCATTTAGGATAATAGATGACCT 
AACTGAAGGAATCCTTGTATATGAAAGGAGTTATTTTAGAAAAGCAATAAAAATATTTTATTC 
ATCNTAAAAAAAAAA 
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FIGURE 102 

MRTPQLALLQVFFLVFPDGVRPQPSSSPSGAVPTSLELQRGTDGGTLQSPSEATATRPAVPGL 
PTWPTLVTPSAPGNRTVDLFPVLPICVCDLTPGACDINCCCDRDCYLLHPRTVFSFCLPGSV 
RSSSWVCVDNSVIFRSNSPFPSRVFMDSNGIRQFCVHVNNSNLNYFQKLQKVNATNFQALAAE 
FGGESFTSTFQTQSPPSFYRAGDPILTYFPKWSVISLLRQPAGVGAGGLCAESNPAGFLESKS 
TTCTRFFKNLASSCTLDSALNAASYYNFTVLKVPRSMTDPQNMEFQVPVILTSQANAPLLAGN 
TCQNVVSQVTYEIETNGTFGIQKVSVSLGQTNLTVEPGASLQQHFILRFRAFQQSTAASLTSP 
RSGNPGYIVGKPLLALTDDISYSMTLLQSQGNGSCSVKRHEVQFGVNAISGCKLRLKKADCSH 
LQQEIYQTLHGRPRPEYVAIFGNADPAQKGGWTRILNRHCSISAINCTSCCLIPVSLEIQVLW 
AYVGLLSNPQAHVSGVRFLYQCQSIQDSQQVTEVSLTTLVNFVDITQKPQPPRGQPKMDWKWP 
FDFFPFKVAFSRGVFSQKCSVSPILILCLLLLGVLNLETM 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 484-505, 581-600 

N-glycosylation sites. 

amino acids 78-82, 165-169, 179-185, 279-285, 331-337, 347-351, 
410-414, 487-491 

N-myristoylation sites . 

amino acids 30-36, 41-47, 124-130, 232-238, 236-242, 409-415 

Prokaryotic membrane lipoprotein lipid attachment site, r 

amino acids 420-431 
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FTftURE 103 

CCTAATTCTCAAGGTGATGCTATTTAGGAAGTCATAACTCATGTGAGTGGAGCCATGTGGGAT 

TAAGAAGTGATAGGAGAGCTTGCTGTCTGTCTCTGCTCTCCACTGTGTGAGGATACAACAGGA 

AGACAGCCATCTGGTGAGGAAGAGAGGGCCCTCGCCAGATACCGGACCTGCTGACACCTTGAT 

CTTGGACTTCCCATCTTCCAGGAAGGCCTGACCTCAGTTGTTCCAGGGTAAAGAATTTGGGCA 

GTGCCCACACCCACGCTGTTGGATAACATTTCTTCACCATACCAGTGAGGGTGAATGTGTACA 

CGCCCAGCTTCCTGCCTGTTACTCTCCACAGTATGCGAAGAATATCCCTGACTTCTAGCCCTG 

TGCGCCTTCTTTTGTTTCTGCTGTTGCTACTAATAGCCTTGGAGATCATGGTTGGTGGTCACT 

CTCTTTGCTTCAACTTCACTATAAAATCATTGTCCAGACCTGGACAGCCCTGGTGTGAAGCGC 

AGGTCTTCTTGAATAAAAATCTTTTCCTTCAGTACAACAGTGACAACAACATGGTCAAACCTC 

TGGGCCTCCTGGGGAAGAAGGTATATGCCACCAGCACTTGGGGAGAATTGACCCAAACGCTGG 

GAGAAGTGGGGCGAGACCTCAGGATGCTCCTTTGTGACATCAAACCCCAGATAAAGACCAGTG 

ATCCTTCCACTCTGCAAGTCGAGATGTTTTGTCAACGTGAAGCAGAACGGTGCACTGGTGCAT 

CCTGGCAGTTCGCCACCAATGGAGAGAAATCCCTCCTCTTTGACGCAATGAACATGACCTGGA 

CAGTAATTAATCATGAAGCCAGTAAGATCAAGGAGACATGGAAGAAAGACAGAGGGCTGGAAA 

AGTATTTCAGGAAGCTCTCAAAGGGAGACTGCGATCACTGGCTCAGGGAATTCTTAGGGCACT 

GGGAGGCAATGCCAGAACCGACAGGCAGAAGATCCACCIA6AGGTGATACCACGGCGGCGCAG 

AGTTGTTCACCTGTGGTCCTCGATCGCTGACAGCCTTGGCTCCCACTGCTGTGTGTTCCCTGA 

GTCAAGTGGAGGCGGAGCCTGCAATGAGCGGAGATCGCGCCTCTGCATTCCAGTCTTGGCAAC 

AGAGCAAGACTCCGTCTCAAAAAAAAAAAATTTTTTTTCAGTACATATTTTTTAAAAGATAGG 

GCTGGGCACAGCAGCTCACATCTATAATCCCAACACTTTGGGAGGCCTAGGCAGGAGGATCAC 

TTGAGCCCAGGAATCTGAAGCTGCAGTGAGCCTTTGCTCGTGAGATTGTGGACCTATGATCCT 

ACCACCAGCCCACCTGGTTCTAACACCCCCTCCTCTATGTGTGAGAGGGAGAGAAGAAAAGTG 

AGGGAGAAAAGAGAGATAAGCAAAGAACAGAGAGGAAAAATGGAAAATAAGAGGAAATTGGGG 

GAATTAAACAGAGGGGAGGGCATGGATCCCCGGGAGTTAGAAGAGTAGCAGCTTGTGGATTAC 

TACGCAGTGGAGGAAGAAGAGTTGTTGGAAATTATTTGAGAGGTAGTATAATCATTTGTGAGG 

CAGTTTTCTGCATTCACCATTTCTCACAGACTAAGTTACTCATAAGCAAACGTGCAATTCACA 

TTACACTGAAATTCTTCCCTAATACATCAT^TTGCATTGGAATAAAGTACGGTTTTCAAACAAC 

CTGATATAGCAGAACTGACTGTATAAATTATGTGAGCACAGTGCAAGTAATTCTTTGTTTGTT 

TGTTTGTTTTTTTGAGACAGAGTCTCACTCTATCTCCCAGGCTGGAGTGTAGTGGTGCGATCC 

CGGCTCACTGCAACCTCGATCTCCCAGGCTCAAGCGATTCCCCTGCCTCAGCCTCCTGAGTAG 

CTGGGATTACAGGCATGAGCCACCACGCCCGGCTAATTTTTGTATTTTTAGTAGAGACGGGGT 

TTCACCCTGTTGGCCAGGCTGGTCTCGAACTACGGACCTCAGGTGATCTGCCCCCCTCAGCCT 

CTCAAAGTGCTGGGATTATAGCATGAGCCACTGAGCCCAGACACAAGTAGTTCTTTCTGATAA 

ACACTTTAACACTGAATGCA 
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FIGURE 104 

MRRISLTSSPVRLLLFLLLLLIALEIMVGGHSLCFNFTIKSLSRPGQPWCEAQVFLNKNLFLQ 
YNSDNNMVKPLGLLGKKVYATSTWGELTQTLGEVGRDLRMLLCDIKPQIKTSDPSTLQVEMFC 
QREAERCTGASWQFATNGEKSLLFDAMNMTWTVINHEASKIKETWKKDRGLEKYFRKLSKGDC 

DHWLREFLGHWEAMPEPTGRRST 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain : 

amino acids 11-30 (possible type II protein) 

N-glycosylation site. 

amino acids 36-39, 154-157 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 2-5, 182-185, 209-212 

Casein kinase II phosphorylation site. 

amino acids 86-89, 93-96, 142-145, 185-188 

N-myristoylation site . 

amino acids 46-51 



Amidation site. 

amino acids 77-80, 207-210 
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FIGURE 105 

TTTTCCGAGTGACCTTCTTGMGCTGGCTGTTTCTCTCACCGTTCCCCTGCTTGGAGCCATGA 

TGCTGCTGGAATCTCCTATAGATCCACAGCCTCTCAGCTTCAAAGAACCCCCGCTCTTGCTTG 

GTGTTCTGCATCCAAATACGAAGCTGCGACAGGCAGAAAGGCTGTTTGAAAATCAACTTGTTG 

GACCGGAGTCCATAGCACATATTGGGGATGTGATGTTTACTGGGACAGCAGATGGCCGGGTCG 

TAAAACTTGAAAATGGTGAAATAGAGACCATTGCCCGGTTTGGTTCGGGCCCTTGCAAAACCC 

GAGATGATGAGCCTGTGTGTGGGAGACCCCTGGGTATCCGTGCAGGGCCCAATGGGACTCTCT 

TTGTGGCCGATGCATACAAGGGACTATTTGAAGTAAATCCCTGGAAACGTGAAGTGAAACTGC 

TGCTGTCCTCCGAGACACCCATTGAGGGGAAGAACATGTCCTTTGTGAATGATCTTACAGTCA 

CTCAGGATGGGAGGAAGATTTATTTCACCGATTCTAGCAGCAAATGGCAAAGACGAGACTACC 

TGCTTCTGGTGATGGAGGGCACAGATGACGGGCGCCTGCTGGAGTATGATACTGTGACCAGGG 

AAGTAAAAGTTTTATTGGACCAGCTGCGGTTCCCGAATGGAGTCCAGCTGTCTCCTGCAGAAG 

ACTTTGTCCTGGTGGCAGAAACAACCATGGCCAGGATACGAAGAGTCTACGTTTCTGGCCTGA 

TGAAGGGCGGGGCTGATCTGTTTGTGGAGAACATGCCTGGATTTCCAGACAACATCCGGCCCA 

GCAGCTCTGGGGGGTACTGGGTGGGCATGTCGACCATCCGCCCTAACCCTGGGTTTTCCATGC 

TGGATTTCTTATCTGAGAGACCCTGGATTAAAAGGATGATTTTTAAGCTCTTTAGTCAAGAGA 

CGGTGATGAAGTTTGTGCCGCGGTACAGCCTCGTCCTAGAACTCAGCGACAGCGGTGCCTTCC 

GGAGAAGCCTGCATGATCCCGATGGGCTGGTGGCCACCTACATCAGCGAGGTGCACGAACACG 

ATGGGCACCTGTACCTGGGCTCTTTCAGGTCCCCCTTCCTCTGCAGACTCAGCCTCCAGGCTG 

TTSAgCCCTCCCAGATAGCTGCCCCTGCCACGCAGGCCAGGAGTCTTCACACTCAGGCACCAG 

GCCTGGTCCAGGAGGAGCTGTGGACACAGTCGTGGTTCAAGTGTCCACATGCACCTGTTAGTC 

CCTGAGAGGTGGTGGGAATGGCTGCTTCATTCCTCGAGGATGCCCGGGCCCCACCTGGGCTTG 

TCTTTCTGTTTAGAGGGAAGTGTAACATATCTGCCATGAGGAACATAAATTCATGTAAAGCCA 

TTTTCTCTTAAACAAAACAAAACTTTCTAAGTACAATCATTCTCTAGGATTTGGGAAGCTCCT 

TGCACTTGGAACAGGGCTCAGGTGGGTGGAGCAGTAAGGCACTACCCAGAGAGCTTGCTGCTG 

CGGCCCTGTCCTGCGGCCTCAAAGTTCTTCTTTACTATATATAACGTGCGGTCATACCTTTCT 

TCGTTGTGGTGGGGATGGAAGAGCAGAGGGAGCATGGCCCAGGGGTGTTGAGGCCAGCGGTGA 

GAGCCGTGTTAGCCAAGACATGGAACTGTGTTCTCAAGGGTTATGTGGGGCGTGGGCTCTCCA 

TAGTGTGTATGAAAAGCTTGTTGACTCTAGCGGCTCAGAGAGGACTTTGCTGGGTTTCTTTCT 

GTGAATATCTCCGTGCTGACCATGCTGGAATTGGATGATTCTGCAATTCGGGACCTACTGCAG 

GGGTCCGTTTAGTAACGTCTTGTCTGTGATCTTTGTTCTTGACCTCTAGACCCCAAGATGTGA 

ACAGTGCACGTGTTAATGTCATCTTTGCTCATGTGTTATAAGCCCCAAGTTGCTGTATATTTT 

CACAAGTATGTCTACACACTGG 
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FIGURE 106 

MLAVSLTVPLLGAMMLLESPIDPQPLSFKEPPLLLGVLHPNTKLRQAERLFENQLVGPESIAH 
IGDVMFTGTADGRVVKLENGEIETIARFGSGPCKTRDDEPVCGRPLGIRAGPNGTLFVADAYK 

GL FE VN PWKRE VKLLLS SETP IEGKNMS FVN DLT VTQ DGRK I Y FT DS S S KWQRRD YLLL VMEG 
TDDGRLLEYDTVTREVKVLLDQLRFPNGVQLSPAEDFVLVAETTMARIRRVYVSGLMKGGADL 
FVENMPGFPDNIRPSSSGGYWVGMSTIRPNPGFSMLDFLSERPWIKRMIFKLFSQETVMKFVP 
RYSLVLELSDSGAFRRSLHDPDGLVATYISEVHEHDGHLYLGSFRSPFLCRLSLQAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-13 

Transmembrane domain: 

amino acids 1-21 (possible type II) 

N-glycosylation sites. 

amino acids 116-119, 152-155 

Casein kinase II phosphorylation sites. 

amino acids 19-22, 27-30, 98-101, 146-149, 221-224, 286-289, 332- 
335 

N-myristoylation sites. 

amino acids 71-76, 92-97, 189-194, 244-249, 338-343 

Amidation site. 

amino acids 164-167 
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agacgcgSggg^^ 



c 



CCTTGATTCTGCTCCACTTCAGAGTGACTCCAGACCCCACCAGGCCTCTTACTTTCCCCAA^ 



CAGCTGGATCAAGGAACTGCATTGGGCAGGA 

E¥MM==M 

TATTCCTTAAAA 
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PTftTTRE 108 

MEFSWLETRWARPFYLAFVFCLALGLLQAIKLYLRRQRLLRDLRPFPAPPTHWFLGHQKFIQD 
DNMEKLEEIIEKYPRAFPFWIGPFQAFFCIYDPDYAKTLLSRTDPKSQYLQKFSPPLLGKGLA 
ALDGPKWFQHRRLLTPGFHFNILKAYIEVMAHSVKMMLDKWEKICSTQDTSVEVYEHINSMSL 

DI IMKCAFSKETNCQTNSTHDPYAKAI FELSKI I FHRLYSLLYHS DI I FKLS PQGYRFQKLSR 
VLNQYTDTIIQERKKSLQAGVKQDNTPKRKYQDFLDIVLSAKDESGSSFSDIDVHSEVSTFLL 
AGHDTLAASISWILYCLALNPEHQERCREEVRGILGDGSSITWDQLGEMSYTTMCIKETCRLI 
PAVPSISRDLSKPLTFPDGCTLPAGITWLSIWGLHHNPAVWKNPKVFDPLRFSQENSDQRHP 
YAYLPFSAGSRNCIGQEFAMIELKVTIALILLHFRVTPDPTRPLTFPNHFILKPKNGMYLHLK 

KLSEC 

Important features of the protein: 
Signal peptide: 

amino acids 1-29 

Transmembrane domains: 

amino acids 310-330, 397-413, 459-473 

N-glycosylation site. 

amino acids 206-210 

CAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 265-269, 504-520 

N-myristoylation sites. 

amino acids 25-31, 298-304, 353-359, 450-456, 456-462 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 447-457 

Cytochrome P450 cysteine heme-iron ligand proteins . 

amino acids 444-475 
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FT CURE 109 

GGCGTTCCGGGCCTCAACTTTGGCGTCGTGAGATTCTTGTGAGGCGTCTGCCTGGAAGCCGGC 

AGCAATTTTGCTTCTTTAAAGAGAAAAAGAAGGCTAGGGACTCAGATTCCTGGATTCTGAGAT 

CCAGACCAGCTCCTCCCAGACCTCTCCAGAAGAAGCCATGGGAACCCCTCGTATCCAGCATTT 

GCTGATCCTCCTGGTCCTAGGAGCCTCCCTCCTGACCTCGGGCCTAGAGCTGTATTGTCAAAA 

GGGTCTGTCCATGACTGTGGAAGCAGATCCAGCCAATATGTTTAACTGGACCACAGAGGAAGT 

GGAGACTTGTGACAAAGGGGCACTTTGCCAGGAAACCATACTAATAATTAAAGCAGGGACTGA 

GACAGCCATTTTGGCCACGAAGGGCTGCATCCCGGAAGGGGAGGAGGCCATAACAATTGTCCA 

GCACTCTTCACCTCCCGGCCTGATCGTGAGCTCCTACAGTAACTACTGTGAGGATTCCTTCTG 

TAATGACAAAGACAGCCTCTCTCAGTTTTGGGAGTTCAGTGAGACCACAGCTTCCACTGTGTC 

AACAACCCTCCATTGTCCAACCTGTGTGGCTTTGGGGACCTGTTTCAGTGCTCCTTCTCTTCC 

CTGTCCCAATGGTACAACTCGATGCTATCAAGGAAAACTTGAGATCACTGGAGGTGGCATTGA 

GTCGTCTGTGGAGGTCAAAGGCTGTACAGCCATGATTGGCTGCAGGCTGATGTCTGGAATCTT 

AGCAGTAGGACCCATGTTTGTGAGGGAAGCGTGCCCACATCAGCTGCTCACTCAACCTCGAAA 

GACTGAAAATGGGGCCACCTGTCTTCCCATTCCTGTTTGGGGGTTACAGCTACTGCTGCCATT 

GCTGCTGCCATCATTTATTCACTTTTCCTAAGAAGGCACTTCTGGGCCTGGGTCTGAGGACAT 

CTTTTTTGACTGGGAGCCTTCTTACTGTTGAGGTTCAACAAGCTGAGGAGTAGATGGGAATTT 

GAGGGAGAATACAGAGATACTATGAACGTATTTGACATTTTTAATACAATTTCTGCTATAATT 

TTTGTATGCAGTAGGCGTTACTAATAAACATTTCTGCTGTGA 
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FIGURE 110 

MGTPRIQHLLILLVLGASLLTSGLELYCQKGLSMTVEADPANMFNWTTEEVETCDKGALCQET 
ILIIKAGTETAILATKGCIPEGEEAITIVQHSSPPGLIVTSYSNYCEDSFCN DKDSLSQFWEF 
SETTASTVSTTLHCPTCVALGTCFSAPSLPCPNGTTRCYQGKLEITGGGIESSVEVKGCTAMI 
GCRLMSGILAVGPMFVREACPHQLLTQPRKTENGATCLPIPVWGLQLLLPLLLPSFIHFS 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 184-201 

N-glycosylation sites. 

amino acids 45-49, 159-163 



N-myristoylation sites. 

amino acids 31-37, 70-76, 99-105, 147-153, 160-166, 174-180, 
175-181 
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FIGURE 111 

CGAGAAGAGGACAGAGGAGACTGAGCAAAGGGGGGTGGGCTCCAGGCGACCCCTAGCCCAATTCTGCCCCTCCAT 

cccSgggcagagaaattgtctttctttgctgactcctac 

AGGGAAAGATAAACGGAGACGGAGGAAAGGTGGCAGCCAGATTACTTAGAGAGGCACAGAGGAGAGAGATCGGGG 
TGAGTCGCCATGGGGACTCCCAGGGCCCAGCACCCGCCGCCTCCCCAGCTGCTGTTCCTAATTCTGCTGAGCTGT 
CCCTGGATCCAGGGTCTGCCCCTGAAGGAGGAGGAGATATTGCCAGAGCCTGGAAGTGAGACCCCCACGGTGGCC 
TCTGAGGCCCTGGCTGAACTGCTTCATGGGGCCCTGCTGAGGAGGGGCCCAGAGATGGGCTACCTGCCAGGATCT 

gSccggaccccacgctagccacccctccggccggccagactctcgcagtgccctccctgccac 
SSSc^ggagaagg^ 

g^gSSSgggSS^Jcaatgc^ 

?gcaSgSSotgtggag^ggagc^ 

agcc^gSaagactgcgtgtg 

aatgtgcgggaaggggacatgctgacgctgttcgacggggacggtccc 
gcctgccaaaagatcatgacttgtgctgaccctggcgagattgc 

??ScCG^C?CCCACGTCCAG^ 

?acagccgggac!cS 

AACCCGGGGGTTCCCGAGAATGGCTACCAGACGCTGTACAAGCACCACTACCAGGCGGGCGAGTC 
^CTGCTATGAGGGCTTTGAGCTTATCGGCGAGGTCACCATCACCTGTGTGCCCGGCCACCCCTC 

Igccagcccccactctgcaa^g^ 

rcCATCCTGCTGCCTCTAGGCTTGGTCATTGTCCTCGGCAGTGGCGTTTACATCTACTACACCA^ 

^gaagSggggatacgcgggagtatgaagtttccatc^accccaagactacagctgcaggac 

CCCTCCCCTCCTCATTCGGGCAGAGGGAAATACGGGACCCGGTCTCTGCCTCCTGGCTGCCCTCCTC^ 

tgtaStagtctccctatcccacgagggggctttgatggccctggagatcctacagtaaataaaccagcatcctg 

CCGCCCAAAAAA 
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FIGURE 112 

MGTPRAQHPPPPQLLFLILLSCPWIQGLPLKEEEILPEPGSETPTVASEALAELLHGALLRRG 
PEMGYLPGSDPDPTLATPPAGQTLAVPSLPRATEPGTGPLTTAVTPNGVRGAGPTAPELLTPP 
PGTTAPPPPSPASPGPPLGPEGGEEETTTTIITTTTVTTTVTSPVLCNNNISEGEGYVESPDL 
GSPVSRTLGLLDCTYSIHVYPGYGIEIQVQTLNLSQEEELLVLAGGGSPGLAPRLLANSSMLG 
EGQVLRSPTNRLLLHFQSPRVPRGGGFRIHYQAYLLSCGFPPRPAHGDVSVTDLHPGGTATFH 
CDSGYQLQGEETLICLNGTRPSWNGETPSCMASCGGTIHNATLGRIVSPEPGGAVGPNLTCRW 
VIEAAEGRRLHLHFERVSLDEDNDRLMVRSGGSPLSPVIYDSDMDDVPERGLISDAQSLYVEL 
LSETPANPLLLSLRFEAFEEDRCFAPFLAHGNVTTTDPEYRPGALATFSCLPGYALEPPGPPN 
AIECVDPTEPHWNDTEPACKAMCGGELSEPAGVVLSPDWPQSYSPGQDCVWGVHVQEEKRILL 
QVEILNVREGDMLTLFDGDGPSARVLAQLRGPQPRRRLLSSGPDLTLQFQAPPGPPNPGLGQG 
FVLHFKEVPRNDTCPELPPPEWGWRTASHGDLIRGTVLTYQCEPGYELLGSDILTCQWDLSWS 
AAPPACQKIMTCADPGEIANGHRTASDAGFPVGSHVQYRCLPGYSLEGAAMLTCYSRDTGTPK 
WSDRVPKCALKYEPCLNPGVPENGYQTLYKHHYQAGESLRFFCYEGFELIGEVTITCVPGHPS 
QWTSQPPLCKVTQTTDPSRQLEGGNLALAILLPLGLVIVLGSGVYIYYTKLQGKSLFGFSGSH 

SYSPITVESDFSNPLYEAGDTREYEVSI 

Important features of the protein: 
Signal peptide: 

amino acids 1-27 
Transmembrane domain : 
amino acids 842-864 
N-glycosylation sites. 

amino acids 176-180, 222-226, 247-251, 332-336, 355-359, 373-377, 

473-477, 517-521, 641-645 

Tyrosine kinase phosphorylation site. 

amino acids 61-69 

N-myristoylation sites . 

amino acids 2-8, 84-90, 111-117, 114-120, 190-196, 198-204, 
235-241, 309-315, 333-339, 351-357, 472-478, 484-490, 528-534, 
626-632, 665-671, 775-781, 842-848 
Amidation site, 
amino acids 384-388 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 12-23 

CUB domain proteins profile. 

amino acids 202-218, 376-392, 553-569 
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FIGURE 113 

^^PPP^^S^^p^^^^^^q^jGqACAAAATATTGGAATCAAGCATATTCCTGCAACCCAGTGTGGCAT 

??ggg^cgSccag^aXgaggtS^^ 
?a???aSSggSgctcc^ 
I?wgag?gSgg?wgatcacttggaagt 
Jggcgtgaaaagccctccattaattagatcaacagggagattcatgtgga 

TrftAGGACTG^^TTTCGAGCAAAM 

gScaaaaccaggccaagccgttgattgcatctggaccattaaagcc^^ 

?^5a^tatca^^gagcactcaaatgaatgcaagagaaacttcgttgcagtctatgatggaagcagt^ 
?aSgaaScc?gaaggccaagttttgcagcactgtggccw 

tpJaS^GGCAGATGAAGGTAGTCGGCTTAGCAGGTTTCGAATG 
CACMGCAGCACTTTCT^TGC^ATAG^ 

ScmacJSgggatgaaaatcattgtaaa^ 
SgaStggaggagmtccctgtg^ 

cattgacttc^aatcttctgctaatggtgatgtgaattcttagggtgtgtacgtacgcagcctccaggg^ 

arTrTTTCCAGCAGCCAACCCTTTTCTCCCATCACAACTACGAAGACCTTGATTTACCGTTAACCTATTGTATGG 

MTCATCTCTATTGCTTGGTGTCATTTATAGGAAGCACTGCCAGTTA 
^rCAGGCTCAGGCTGCCTCTTCGTTTTAGCAACAAGAAGACTGCTCTTGACTGATAACAG 

?gSgccSS?aSSgattt?S 

rrCTATCGT^TAC^ATTT^ 

t?SSgISg?gSaaIatacagctm^ 
gSmga^attacctttcaattatatttttctcagttatactagaaa^ 

CA^GTCAGCTATGACTAAAAATTTGAA^ 

SISca^gacS^ 
?t?SS?g??S?Stcgctagattc 

??ctagSctS?caag^aacaacttccacttt 
caataatttataaacataaamctcattgtgttttttagac 

g^gga^??gga^ctataacttattgtaaagaccttggaaat 

TATACATTGTACCTGAAAAAAAA 
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FT CURE 114 

MALERLCSVLKVLLITVLVVEGIAVAQKTQDGQNIGIKHIPATQCGIWVRTSNGGHFASPNYP 
DSYPPNKECIYILEAAPRQRIELTFDEHYYIEPSFECRFDHLEVRDGPFGFSPLIDRYCGVKS 
PPLIRSTGRFMWIKFSSDEELEGLGFRAKYSFIPDPDFTYLGGILNPIPDCQFELSGADGIVR 
SSQVEQEEKTKPGQAVDCIWTIKATPKAKIYLRFLDYQMEHSNECKRNFVAVYDGSSSIENLK 
AKFCSTVANDVMLKTGIGVIRMWADEGSRLSRFRMLFTSFVEPPCTSSTFFCHSNMCINNSLV 
CNGVQNCAYPWDENHCKEKKKAGVFEQITKTHGTIIGITSGIVLVLLIISILVQVKQPRKKVM 
ACKTAFNKTGFQEVFDPPHYELFSLRDKEISADLADLSEELDNYQKMRRSSTASRCIHDHHCG 
SQASSVKQSRTNLSSMELPFRNDFAQPQPMKTFNSTFKKSSYTFKQGHECPEQALEDRVMEEI 

PCEIYVRGREDSAQASISIDF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain : 

amino acids 348-369 

N-glycosylation sites . 

amino acids 311-315, 385-389, 453-457, 475-479 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 426-430, 479-483 

N-myristoylation sites. 

amino acids 22-28, 32-38, 54-60, 186-192, 279-285, 318-324, 
348-354, 352-358, 441-447 
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FTfiURE 115 

===t=iii 



^ TTOT CAAC™CTCT ? ™ 

TTTCCAAATT 
AATATCAGT"] 
ATGAAAAAT7 
.GACCAGCCC] 
.TATCTGGAAi 

atactgtctt 
:gatatattac 
,gcatactta( 

iTGCAATAAAi 
?TCGGTTATTi 
JTGTTTCTGTC 
5ATTTTTAGC 
:CTTCAGAAG< 
DTTGGAATCC 
TGATTATAAC 
\AAGTGTGAG' 
TACAGAGTAC 
TTTGTGCAAG- 

^CTCAAACTACTGATAACCAGTGTGCCAAGATATTTGGAAAAGGTGCTCAAGGTGCTCCA 



TTGGTTTATACATATAATGAAACTGGATCTTTGCATTCTGTGTCTCCATATTTTATGAT^ 

\Ai 
\N] 
\G1 
\T1 
TA 1 ] 
TG 
^G( 
AT< 
TT' 
GTi 
GA 
CC 
TT 
.TG 
AA 

CCTGACACTTATGCACTGAATGGCCGT^GTGCAAGTTGG 



TGCCATTACCAAGGATATGCTGCCGAATTTCCAAATTCATTTGTGACACT^CAGTATATGTTCT 

— 3AAAAT7 

:aaatg; 

f^AAGACC 
3AATATC 
3TTATAC 
GATGATi 

GTATGCi 
TTTTCGi 
CAGTGT' 
AAGATT 
TGCCTT< 
IATTTTG 
1TGTGAT 

AAGAGTATTGATC^ 



GGTCTCAGGGGATTTCTCCAGTTTGAAAATATCAGTTATGGAATTGAACCAGTAGAATCTTCA 

t^AAAATCTTTCAAAACTATTACCCC. 
ATGTTTACCCAGTTCAAATTGACTG' 
3AGATTTCCACCAGTGGGGATGCTG 
TATCTCATCCTACGGCCCCATGACA 
GTGGGAGCAACATTTCCTGGCACCG 
CCAGATGCAATAGGTTTGGAGGGAT 
GGATTAACATATGATGACATCACTC 
GAAGCAGTGAGTGCCAGTGGTAGA? 
TTTGTTTCAAAATTTGAGACTAAAT 

TGTGGTTCTGGACCATGTTGTACATCAAAGTGTGAGTT 



GCAAGATTTGAGCATATAATTTATCAAATGAAAAATAATGATCCAAATGTM 

" ^ * " " jAAAGACCAGCCCTAC 

XAATATCTGGAAATj 

cgttatactgtcttcc 
tgatgatatattaca 
:atagcatacttact' 
:gtatgcaataaaag< 

&TTTTCGGTTATTAT 
TCAGTGTTTCTGTCTi 
AAAGATTTTTAGCAA 
&TGCCTTCAGAAGCT 
GATTTTGGAATCCAA 

GTCAATAGCAGGCACTCCATGTAGA 



CAGATTTCCACCAGTGGGGATGC 
TATCTCATCCTACGGCCCCATGA 
GTGGGAGCAACATTTCCTGGCAC 
CCAGATGCAATAGGTTTGGAGGG 
GGATTAACATATGATGACATCAC 



GTGGGAGCAACATTTCCTGGCACCGTATGC 



Sag^aIcSaaaggaacgaatgtgcttta^ 
aSg?aaa^?caaacttttggaaaataaagcctgcgtgccctccc 
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FTfltTRE 116 

MFLLLALLTELGRLQAHEGSEGIFLHVTVPRKIKSNDSEVSERKMIYIITIDGQPYTLHLGKQ 

SFLPQNFLVYTYNETGSLHSVSPYFMMHCHYQGYAAEFPNSFVTLSICSGLRGFLQFENISYG 

IEPVESSARFEHIIYQMKNNDPNVSILAVNYSHIWQKDQPYKVPLNSQIKNLSKLLPQYLEIY 

IIVEKALMFTQFKLTVILSSLELWSNENQISTSGDADDILQRFLAWKRDYLILRPHDIAYLLV 

YRKHPKYVGATFPGTVCNKSYDAGIAMYPDAIGLEGFSVIIAQLLGLNVGLTYDDITQCFCLR 

ATCIMNHEAVSASGRKIFSNCSMHDYRYFVSKFETKCLQKLSNLQPLHQNQPVCGNGILESNE 

ECDCGNKNECQFKKCCDYNTCKLKGSVKCGSGPCCTSKCELSIAGTPCRKSIDPECDFTEYCN 

GTSSNCVPDTYALNGRLCKLGTAYCYNGQCQTTDNQCAKIFGKGAQGAPFACFKEVNSLHERS 

ENCGFKNSQPLPCERKDVLCGKLACVQPHKNANKSDAQSTVYSYIQDHVCVSIATGSSMRSDG 

TDNAYVADGTMCGPEMYCVNKTCRKVHLMGYNCNATTKCKGKGICNNFGNCQCFPGHRPPDCK 

FQFGSPGGSIDDGNFQKSGDFYTEKGYNTHWNNWFILSFCIFLPFFIVFTTVIFKRNEISKSC 

NRENAEYNRNSSVVSESDDVGH 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 
Transmembrane domain: 
amino acids 665-684 
N-glycosylation sites. 

amino acids 36-39, 76-79, 122-125, 149-152, 156-159, 177-180, 
270-273, 335-338, 441-444, 537-540, 587-590, 601-604, 703-706 
Casein kinase II phosphorylation sites. 

amino acids 74-77, 208-211, 221-224, 304-307, 337-340, 346-349, 
376-380, 415-418, 499-502, 639-642, 708-711 
Tyrosine kinase phosphorylation site, 
amino acids 243-249 
N-myristoylation sites . 

amino acids 53-58, 79-84, 266-271, 298-303, 372-377, 403-408, 
408-413, 442-447, 462-467, 469-474, 488-493, 567-572, 610-615, 
616-621, 634-639 
Ami da ti on site, 
amino acids 328-331 
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FIGURE 117 

CCCACGCGTCCGCGGACGCGTGGGGCTCAGTGGGCGTCGCGCGAAGGCTAAGGGAGTGTGGCG 

GGCGGCTCCGGGAGCCAAC£T§CCTCGGTATGCGCAGCTGGTCATGGGCCCCGCGGGCAGCGG 

GAAGAGCACCTACTGTGCCACCATGGTCCAGCACTGTGAAGCCCTCAACCGGTCTGTCCAAGT 

TGTAAACCTGGATCCAGCAGCAGAACACTTCAACTACTCCGTGATGGCTGACATCCGGGAACT 

GATCGAGGTGGATGATGTAATGGAGGATGATTCTCTGCGATTCGGTCCCAACGGAGGATTGGT 

ATTTTGCATGGAGTACTTTGCCAATAATTTTGACTGGCTGGAGAACTGTCTTGGCCATGTAGA 

GGACGACTATATCCTTTTTGATTGTCCAGGTCAGATTGAGTTGTACACTCACCTGCCTGTGAT 

GAAACATCTGGTCCAGCAGCTCGAGCAGTGGGAGTTCCGAGTCTGTGGAGTTTTTCTTGTTGA 

TTCTCAGTTCATGGTGGAGTCATTCAAGTTTATTTCTGGCATCTTGGCAGCCCTGAGTGCCAT 

GATCTCTCTAGAAATTCCGCAAGTCAACATCATGACAAAAATGGATCTGCTGAGTAAAAAAGC 

AAAAAAGGAAATTGAGAAATTTTTAGATCCAGACATGTATTCTTTATTAGAAGATTCTACAAG 

TGACTTAAGAAGCAAAAAATTCAAGAAACTGACTAAAGCTATATGTGGACTGATTGATGACTA 

CAGCATGGTTCGATTTTTACCTTACGATCAGTCAGATGAAGAAAGCATGAACATTGTATTGCA 

GCATATTGATTTTGCCATTCAATATGGAGAAGACCTAGAATTTAAAGAACCAAAGGAACGTGA 

AGATGAGTCTTCCTCTATGTTTGACGAATATTTTCAAGAATGCCAGGATGAAT6AAGAGTTTA 

CTAAAAGTAACCATCTAAAGAGCTTGTGGCCAAACCAGCAGAACATTCTTCTCTTCAAAGGAT 

GCAATAGTAGAAAGCTACTTATTTTAATGAAAAAAAGTAAAACTTCGTTCTTTATCAGCCTCA 

TGCCTGAATCAAATTTTTAATTATTCTGAAACTGCTGCTGTTTAAAGTGGAATCTTTTAGTAT 

TATAACAGCATCACTTTAGATTTTGTAAGTCAAAATTGAAATGAATGCACATAGATTTATATA 

TAAATTAGCACCTGAGCTAAGGTTAAGGCCGGTCTAAACTTATTTTCACTTTTTGTATTATTT 

TTGAGATGCAGGAATTACTGTAACAAAATATGTATGTCCGAAGGGAAAAAGCTGCAAGGATAT 

ATATAAGACCACTGCTTATCTGTATCTTCCCATTTTCCTATATTGAAAATGTATATTATTTAT 

ATAACTTAAAAAGTAAAAATAACTATGTTTTGAGAT 
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FT CURE 118 

MPRYAQLVMGPAGSGKSTYCATMVQHCEALNRSVQVVNLDPAAEHFNYSVMADIRELIEVDDV 
MEDDSLRFGPNGGLVFCMEYFANNFDWLENCLGHVEDDYILFDCPGQIELYTHLPVMKHLVQQ 
LEQWEFRVCGVFLVDSQFMVESFKFISGILAALSAMISLEIPQVNIMTKMDLLSKKAKKEIEK 
FLDPDMYSLLEDSTSDLRSKKFKKLTKAICGLIDDYSMVRFLPYDQSDEESMNIVLQHIDFAI 

QYGEDLEFKEPKEREDESSSMFDEYFQECQDE 

Important features of the protein: 
Signal peptide: 

amino acids 1-29 

Transmembrane domain : 

amino acids 151-170 

N-glycosylation sites. 

amino acids 31-35, 47-51 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 212-216 

Tyrosine kinase phosphorylation site. 

amino acids 189-197 

N-myristoylation sites. 

amino acids 13-19, 76-82, 154-160 

ATP/GTP-binding site inotif A (P-loop) . 

amino acids 10-18 
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FIGURE 119 

GGGCGCTGGGAGACACCGGACGCCCGCTCGGCTGCGCTGCGGCTCAGGCCCCCGCTCGGGCCC 
GACCCGCTCGGTCACCGCCGGCTCGGGCGCGCACCTGCCGGCTGCGGCCCCAGGGCCATGCGG 
AGGCCCACGAGGAGGCCGGCGGCCACGCGCATCCCGTAGCCCAGGTGGCCCAGGTCTGCACCG 
CGGCGGCCTCGGCGGCATGGAGCCCCCGTATTCGCTGACGGCGCACTACGATGAGTTCCAAGA 
GGTCAAGTACGTGAGCCGCTGCGGCGCGGGGGGCGCGCGCGGGGCCTCCCTGCCCCCGGGCTT 
CCCGTTGGGCGCTGCGCGCAGCGTCACCGGGGCCCGGTCCGGGCTGCCGCGCTGGAACCGGCG 
CGAGGTGTGCCTGCTGTCGGGGCTGGTGTTCGCCGCCGGCCTCTGCGCCATTCTGGCGGCTAT 
GCTGGCCCTCAAGTACCTGGGCCCGGTCGCGGCCGGCGGCGGCGCCTGTCCCGAGGGCTGCCC 
TGAGCGCAAGGCCTTCGCGCGCGCCGCTCGCTTCCTGGCCGCCAACCTGGACGCCAGCATCGA 
CCCATGCCAGGACTTCTACTCGTTCGCCTGCGGCGGTTGGCTGCGGCGCCACGCCATCCCCGA 
CGACAAGCTCACCTATGGCACCATCGCGGCCATCGGCGAGCAAAACGAGGAGCGCCTACGGCG 
CCTGCTGGCGCGGCCCGGGGGTGGGCCTGGCGGCGCGGCCCAGCGCAAGGTGCGCGCCTTCTT 
CCGCTCGTGCCTCGACATGCGCGAGATCGAGCGACTGGGCCCGCGACCCATGCTAGAGGTCAT 
CGAGGACTGCGGGGGCTGGGACCTGGGCGGCGCGGAGGAGCGTCCGGGGGTCGCGGCGCGATG 
GGACCTCAACCGGCTGCTGTACAAGGCGCAGGGCGTGTACAGCGCCGCCGCGCTCTTCTCGCT 
CACGGTCAGCCTGGACGACAGGAACTCCTCGCGCTACGTCATCCGCATTGACCAGGATGGGCT 
CACCCTGCCAGAGAGGACCCTGTACCTCGCTCAGGATGAGGACAGTGAGAAGATCCTGGCAGC 
ATACAGGGTGTTCATGGAGCGAGTGCTCAGCCTCCTGGGTGCAGACGCTGTGGAACAGAAGGC 
CCAAGAGATCCTGCAAGTGGAGCAGCAGCTGGCCAACATCACTGTGTCAGAGTATGACGACCT 
ACGGCGAGATGTCAGCTCCATGTACAACAAGGTGACGCTGGGGCAGCTGCAGAAGATCACCCC 
CCACTTGCGGTGGAAGTGGCTGCTAGACCAGATCTTCCAGGAGGACTTCTCAGAGGAAGAGGA 
GGTGGTGCTGCTGGCGACAGACTACATGCAGCAGGTGTCGCAGCTCATCCGCTCCACACCCCA 
CCGGGTCCTGCACAACTACCTGGTGTGGCGCGTGGTGGTGGTCCTGAGTGAACACCTGTCCCC 
GCCATTCCGTGAGGCACTGCACGAGCTGGCACAGGAGATGGAGGGCAGCGACAAGCCACAGGA 
GCTGGCCCGGGTCTGCTTGGGCCAGGCCAATCGCCACTTTGGCATGGCGCTTGGCGCCCTCTT 
TGTACATGAGCACTTCTCAGCCGCCAGCAAAGCCAAGGTGCAGCAGCTAGTGGAAGACATCAA 
GTACATCCTGGGCCAGCGCCTGGAGGAGCTGGACTGGATGGACGCCGAGACCAGGGCTGCTGC 
TCGGGCCAAGCTCCAGTACATGATGGTGATGGTCGGCTACCCGGACTTCCTGCTGAAACCCGA 
TGCTGTGGACAAGGAGTATGAGTTTGAGGTCCATGAGAAGACCTACTTCAAGAACATCTTGAA 
CAGCATCCCCTTCAGCATCCAGCTCTCAGTTAAGAAGATTCGGCAGGAGGTGGACAAGTCCAC 
GTGGCTGCTCCCCCCACAGGCGCTCAATGCCTACTATCTACCCAACAAGAACCAGATGGTGTT 
CCCCGCGGGCATCCTGCAGCCCACCCTGTACGACCCTGACTTCCCACAGTCTCTCAACTACGG 
GGGCATCGGCACCATCATTGGACATGAGCTGACCCACGGCTACGACGACTGGGGGGGCCAGTA 
TGACCGCTCAGGGAACCTGCTGCACTGGTGGACGGAGGCCTCCTACAGCCGCTTCCTGCGAAA 
GGCTGAGTGCATCGTCCGTCTCTATGACAACTTCACTGTCTACAACCAGCGGGTGAACGGGAA 
ACACACGCTTGGGGAGAACATCGCAGATATGGGCGTCCTCAAGCTGGCCTACCACGCCTATCA 
GAAGTGGGTGCGGGAGCACGGCCCAGAGCACCCACTTCCCCGGCTCAAGTACACACATGACCA 
GCTCTTCTTCATTGCCTTTGCCCAGAACTGGTGCATCAAGCGGCGGTCGCAGTCCATCTACCT 
GCAGGTGCTGACTGACAAGCATGCCCCTGAGCACTACAGGGTGCTGGGCAGTGTGTCCCAGTT 
TGAGGAGTTTGGCCGGGCTTTCCACTGTCCCAAGGACTCACCCATGAACCCTGCCCACAAGTG 
TTCCGTGTGGT6AGCCTGGCTGCCCGCCTGCACGCCCCCACTGCCCCCGCACGAATCACCTCC 
TGCTGGCTACCGGGGCAGGCATGCACCCGGTGCCAGCCCCGCTCTGGGCACCACCTGCCTTCC 
AGCCCCTCCAGGACCCGGTCCCCCTGCTGCCCCTCACTTCAGGAGGGGCCTGGAGCAGGGTGA 
GGCTGGACTTTGGGGGGCTGTGAGGGAAATATACTGGGGTCCCCAGATTCTGCTCTAAGGGGG 
CCAGACCCTCTGCCAGGCTGGATTGTACGGGCCCCACCTTCGCTGTGTTCTTGCTGCAAAGTC 
TGGTCAATAAATCACTGCACTGTTAAAAAAAAA 
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FIGURE 120 

MEPPYSLTAHYDEFQEVKYVSRCGAGGARGASLPPGFPLGAARSVTGARSGLPRWNRREVCLL 
SGLVFAAGLCAILAAMLALKYLGPVAAGGGACPEGCPERKAFARAARFLAANLDASIDPCQDF 
YSFACGGWLRRHAIPDDKLTYGTIAAIGEQNEERLRRLLARPGGGPGGAAQRKVRAFFRSCLD 
MREIERLGPRPMLEVIEDCGGWDLGGAEERPGVAARWDLNRLLYKAQGVYSAAALFSLTVSLD 
DRNSSRYVIRIDQDGLTLPERTLYLAQDEDSEKILAAYRVFMERVLSLLGADAVEQKAQEILQ 
VEQQLANITVSEYDDLRRDVSSMYNKVTLGQLQKITPHLRWKWLLDQIFQEDFSEEEEWLLA 
TDYMQQVSQLIRSTPHRVLHNYLVWRWWLSEHLSPPFREALHELAQEMEGSDKPQELARVC 
LGQANRHFGMALGALFVHEHFSAASKAKVQQLVEDIKYILGQRLEELDWMDAETRAAARAKLQ 
YMMVMVGYPDFLLKPDAVDKEYEFEVHEKTYFKNILNSIPFSIQLSVKKIRQEVDKSTWLLPP 
QALNAYYLPNKNQMVFPAGILQPTLYDPDFPQSLNYGGIGTIIGHELTHGYDDWGGQYDRSGN 
LLHWWTEASYSRFLRKAECIVRLYDNFTVYNQRVNGKHTLGENIADMGVLKLAYHAYQKWVRE 
HGPEHPLPRLKYTHDQLFFIAFAQNWCIKRRSQSIYLQVLTDKHAPEHYRVLGSVSQFEEFGR 

AFHCPKDSPMNPAHKCSVW 

Important features of the protein: 
Transmembrane domain: 

amino acids 64-88 
N-glycosylation sites. 

amino acids 255-259, 322-326, 656-660 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 722-726 
N-myristoylation site. 

amino acids 24-30, 26-32, 27-33, 40-46, 47-53, 65-71, 148-154, 

169-175, 170-176, 237-243, 450-456, 604-610, ( 607-613 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 85-96 

Prenyl group binding site. 

amino acids 772-777 

Neutral zinc metallopeptidases , zinc-binding region signature. 

amino acids 609-619 

Neutral zinc metallopeptidases, zinc-binding region proteins. 

amino acids 609-619 
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FIGURE 121 

CGGACTGCCCGGACCGCGCGATGGAGTCGACCGGCAGCGTCGGGGAGGCCCCGGGCGGACCCC 

GGGTGCTGGTGGTGGGCGGCGGCATCGCGGGGCTGGGCGCGGCGCAGAGGCTCTGCGGCCACT 

CCGCCTTCCCGCACCTGCGGGTCCTGGAGGCCACGGCCCGCGCCGGGGGCCGCATCCGCTCGG 

AGCGCTGCTTCGGTGGCGTGGTGGAGGTGGGCGCGCACTGGATCCATGGGCCCTCCCGGGGTA 

ACCCCGTCTTCCAGCTGGCTGCTGAGTACGGGCTGCTGGGGGAGAAGGAGCTGTCCCAGGAGA 

ACCAGCTGGTGGAGACCGGGGGTCACGTGGGCCTGCCCTCCGTGAGCTACGCCAGCTCCGGGG 

CCAGCGTGAGCCTCCAGCTGGTGGCGGAGATGGCGACTCTGTTCTACGGCCTGATAGACCAGA 

CCCGGGAGTTCCTGCACGCTGCAGAGACCCCGGTGCCCAGCGTCGGGGAGTACCTCAAGAAGG 

AGATTGGCCAGCACGTGGCCGGCTGGACAGAGGATGAGGAGACCAGGAAGCTGAAGCTGGCCG 

TCCTGAACTCCTTCTTCAACCTGGAATGCTGTGTGAGCGGCACCCACAGCATGGACCTGGTGG 

CCCTGGCACCCTTTGGGGAGTATACCGTGCTGCCGGGGCTGGACTGCACCTTTTCTAAGGGCT 

ATCAAGGACTCACAAACTGCATGATGGCCGCCCTGCCGGAGGACACTGTAGTTTTTGAGAAGC 

CTGTGAAGACCATCCACTGGAACGGGTCCTTCCAGGAGGCAGCCTTTCCCGGGGAGACCTTTC 

CAGTGTCGGTAGAGTGTGAGGATGGAGACCGGTTCCCGGCGCACCATGTCATCGTCACCGTGC 

CCTTAGGTTTTCTTAGGGAACATTTGGACACCTTCTTTGACCCTCCCCTGCCGGCTGAGAAGG 

CAGAAGCAATCAGGAAGATAGGCTTTGGGACCAACAACAAAATCTTCCTGGAGTTTGAGGAGC 

CCTTCTGGGAGCCAGACTGCCAGCTGATCCAGCTGGTGTGGGAGGACACGTCGCCCCTGGAGG 

ATGCTGCCCCTGAGCTACAGGACGCCTGGTTCCGGAAGCTCATTGGCTTTGTGGTCCTGCCTG 

CCTTTGCGTCTGTCCACGTTCTCTGTGGGTTCATTGCCGGACTTGAGTCTGAGTTCATGGAGA 

CTCTGTCGGATGAAGAAGTACTTCTGTGTCTCACCCAAGTGCTCCGGAGAGTGACAGGAAACC 

CACGGCTCCCCGCGCCCAAGAGCGTCCTGCGGTCTCGCTGGCACAGCGCCCCGTACACTAGGG 

GGTCCTACAGCTACGTGGCCGTGGGCAGTACTGGGGGCGACCTGGACCTGCTGGCTCAGCCCC 

TCCCTGCAGACGGCGCCGGCGCCCAGCTCCAGATCCTGTTTGCGGGGGAAGCCACACATCGCA 

CGTTTTACTCCACGACGCACGGGGCTCTGCTGTCGGGATGGAGGGAGGCCGACCGCCTCCTCA 

GTCTGTGGGCCCCGCAGGTGCAGCAGCCCAGGCCGAGGCTCTAGCTGGGCCCAGCCTACTCTG 

TTCCACCCGTGTCGGGGGTAGGCTGGGACCGTCATTTCTTCTGACAGATTTCAGTCTGGCTTG 

AAATTTGGGGATGTTAATGAGGGTCCTCTGGTTTTTGGTAACCAGGGCCACCTTCTCAGTTCT 

TGTGTCTGTTATTGGAGTCTGGCCAGGGTTGACTTGAGCTGAGACACCAGATGCTCACGGAGA 

TGCTGGACACATAAAGCAAGTTACAGCCACAAAAAAAAAAAA 
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FIGURE 122 

MESTGSVGEAPGGPRVLVVGGGIAGLGAAQRLCGHSAFPHLRVLEATARAGGRIRSERCFGGV 
VEVGAHWIHGPSRGNPVFQLAAEYGLLGEKELSQENQLVETGGHVGLPSVSYASSGASVSLQL 
VAEMATLFYGLIDQTREFLHAAETPVPSVGEYLKKEIGQHVAGWTEDEETRKLKLAVLNSFFN 
LECCVSGTHSMDLVALAPFGEYTVLPGLDCTFSKGYQGLTNCMMAALPEDTVVFEKPVKTIHW 
NGSFQEAAFPGETFPVSVECEDGDRFPAHHVIVTVPLGFLREHLDTFFDPPLPAEKAEAIRKI 
GFGTNNKIFLEFEEPFWEPDCQLIQLVWEDTSPLEDAAPELQDAWFRKLIGFWLPAFASVHV 
LCGFIAGLESEFMETLSDEEVLLCLTQVLRRVTGNPRLPAPKSVLRSRWHSAPYTRGSYSYVA 
VGSTGGDLDLLAQPLPADGAGAQLQILFAGEATHRTFYSTTHGALLSGWREADRLLSLWAPQV 

QQPRPRL 

Signal peptide: 

amino acids 1-28 

Transmembrane domain : 

amino acids 364-385 

K-glycosylation site . 

amino acids 253-257 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 408-412 

N-myristoylation sites . 

amino acids 20-26, 21-27, 25-31, 105-111, 119-125, 164-170, 
216-222, 227-233, 443-449, 484-490 

Aminooxidase Flavin containing amine oxidase: 

amino acids 23-497 
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FIGURE 123 

CGGACGCGTGGGGGAAGATGGATAAATAATTCTGTCACACGTGCCCTGGCCTCTGGAGCT 
GTCTAGGGAATCTTAGCATCTGGGACCAAGACACTTTACAGCAATCATCAC 

SgSgccgcgcgc^ 
cccg??g?cgg?Sga?S 

rrftrrrrCCCCTGGACTGGTGGTGAGGCGCCCTCGTGGCCGCGGGACTGGCCCCGGGGGGCCCCCTGGATCCCAb 

TATarTT^A^TTTTTTGTATATGTTTGCTAGCTCTAAAAAGGTCGAGATGCAATAACACT 
TrJrArrJ^GGctAAAATTTCAAGGTGAGCCTGGAGCATTGTGTGTGGTG^ 

^a^gSScacStg^acccag^ 

ATTTCCACTGACTAAGTGGAAAA^ 

GGAAMTGTTTGGT^T^^^CTTAAAATGTCTTCTTTGTATTTTTTAATGTTCftflTRATGTATftTGTfiTCAG 
TTCTGTAATAAAGGGGAAAACACTTTTCA 
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FT CURE 124 

MVDLSVSPDSLKPVSLTSSLVFLMHLLLLQPGEPSSEVKVLGPEYPILALVGEEVEFPCHLWP 
QLDAQQMEIRWFRSQTFNWHLYQEQQELPGRQMPAFRNRTKLVKDDIAYGSWLQLHSIIPS 
DKGTYGCRFHSDNFSGEALWELEVAGLGSDPHLSLEGFKEGGIQLRLRSSGWYPKPKVQWRDH 
QGQCLPPEFEAIVWDAQDLFSLETSVWRAGALSNVSVSIQNLLLSQKKELVVQIADVFVPGA 
SAWKSAFVATLPLLLVLAALALGVLRKQRRSREKLRKQAEKRQEKLTAELEKLQTELDWRRAE 
GQAEWRAAQKYAVDVTLDPASAHPSLEVSEDGKSVSSRGAPPGPAPGHPQRFSEQTCALSLER 
FSAGRHYWEVHVGRRSRWFLGACLAAVPRAGPARLSPAAGYWVLGLWNGCEYFVLAPHRVALT 
LRVPPRRLGVFLDYEAGELSFFNVSDGSHIFTFHDTFSGALCAYFRPRAHDGGEHPDPLTICP 

LPVRGTGVPEENDSDTWLQPYEPADPALDWW 

f 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain : 

amino acids 247-272 

N-glycosylation sites. 

amino acids 102-106, 139-143, 224-228, 464-468, 516-520 

Tyrosine kinase phosphorylation site. 

amino acids 105-114 

N-myristoylation sites. 

amino acids 129-135, 220-226, 399-405, 423-429, 480-486 



Amidation site. 

amino acids 390-394 
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FIGURE 125 

TATAGTCCCAGCTACTCATGGGGCTGATGCAGGTTGAGGCAGGAGGTTCATGAGCCCAGGAGGTTGGAGCTGTAA 

TGAGCTAGGATTCTGCCTCTGCACTCCTAGCTGGATGACAGAGCAAGACCCTGTCTCAAAAAAGAAAAAAAAAAA 

AAAAAGAATGCATGAACCAGACATGACAGTTCCTGGCCTCAAAGATCTTCCAAAGGAAATGATTTTTTTTTAAC^ 

ACCAATGCTGCAGGAAAAAGCAACATATTTAAGTTATCCAATAACACCTATCCAATAATTGTAAATCATTATCAT 

GACATGGTAGAGTTGTTTATATTTCTTTTCCTTTTAGGTGAAACACCATTCAAAGTCGTAGTCAAATCTCTTTCA 

crTAAAGAGTTGGTCCGGATACATGTCCCTAAACCTTTGGACAGGAATGATGGAACATTTTTGATGAGATATAGG 

ATGTATGAAACTGTCGATGAAGGCCTGAAGATAGAGGTCCTTTATGGTGATGAACATGTGGCTCAGTCTCCCTAT 

ATTTTGAAAGGACCAGTGTACCATGAGTACTGTGAGTGTCCGGAAGATCCTCAGGCCTGGCAGAAGACTCTTTCT 

TGTCCAACCAAGGAACCACAGATTGCAAAAGATTTTGCTTCCTTTCCCAGCATCAATCTCCAGC^ 

GAAGTCCCCAAAAGGTTTGGGGATGAGAGAGGTGCCATTGTTCATTACACGATTCTCAATAACCATGTTTACCGG 

AGtTCTTTAGGGAAATACACAGACTTCAAGATGTTCTCTGATGAGATTTTGTTATCATTGACAAGAAAGG 

CTCCCAGATTTAGAATTTTATGTTAATCTTGGAGATTGGCCCTTGGAGCATCGAAAAGTCAATGGAACCCCTAGC 

CCCATACCTATCATTTCATGGTGTGGCTCTCTGGATTCAAGAGATGTTGTCCTTCCAACGTATGACATCACCCAC 

TCCATGCTTGAAGCCATGCGGGGTGTTACAAATGATCTCCTCTCTATTCAGGGAAATACAGGGCCTTCCT 

AATAAAACAGAGAGAGCTTTCTTCAGAGGTAGAGACAGCCGAGAGGAGAGGCTCCAGTTGGTACAGCTGTCCAAA 

GAAAATCCTCAGCTACTAGATGCAGGAATTACAGGATATTTCTTTTTCCAAGAGAAAGAAAAGGAGCTTGGAAAA 

GCCAAGTTGATGGGTTTCTTTGATTTCTTTAAGTACAAGTATCAAGTAAATGTGGATGGGACCGTGGCTGCTTAC 

AGATATCCATATCTCATGCTGGGCGACAGTCTGGTTTTAAAGCAGGACTCGCCATATTATGAACATTTCTACATG 

AGGCTT^CTGCTACTATTACCAAGTACTGCAGAAATATGCCGAGCGCCA 

GGAATGGAACTTGTTCCTCAGCCAGAAGATAGCACAGCCATCTGCCAGTGCCACAGGAAAAAGCCTTCAAGAGAA 
GAACTTTGAGTCAGCCCAGAATCACACTCCTGTGTATCCCGGCTACACTTTAAGGAAAGATTGAATCTAAGCTGT 
GAAGGACIGTATAGAAGACTGCACCAAGTGGACTAGTTCTCCCGGTGGCTTTATATATGTAGATGGATATAGCAG 
TACTGGTTGAGTATCCCTCATCTGAAATGCTTAGGACCAGGAGTGTTTCAGGCTTCAGATTTTTTAAGATTTGGG 

AATATTTGCATGTACATAATGAGGTATCTTGGGGATGAGAT^ 

ATATACCTTGTTCACATACCCTGAAGGTAATTTTATATAATATTTTTAATAATTTGTGCATGAAACAAAGTTTGT 

ataSttgaactgtcagaaagcaaaggtgtcactatcttagcgacc 

TTGGATTTTGGGGTATTTCAGATTTTAGATTTTTGTATGAGGAATGTTCAACCTGTATTTGAACAAGCATTACCA 
AATATCATTGAATATTAATATCTTTTGCGTAAAAACTGCTATTATCAGCATCATAGTTTCTCTAAAAAGAAAACT 
^GGGATCATAGCCGATAGAGAGACTTGCTAAAATATAAATCAGCCTCTGCAAAACTGTTTACATATTTATTGGT 

^acaStttattggtttatttctatcccctgttcactttttctcttc 

TTGTTCAGGGTTGTCCCCCCGCCCCCCGTCACTGCATAATTTCTCCTCTTACAAGCTGCTTTTGGCTTTCATTAA 
TAACAGCTTCCTTTTAGAAGGTCTGATAAGGATATTTAAGGAAGAAGAGAATGACTCTGTTATTAAAGGTGGCAT 

GGAGACTGTGGAGGGAATATTTTTTAAAGCACTACTCATATCCTTTA^ 

CATTAAGGAGAAATTGTACCTTAAGTTAGTAATTCCAAATCTATCTGAGTTGTATACCCATCAAAGACAATACAG 

TTATTMCATAGATGAAGGTATGCTATAGGCATCATTCATTATCTCTATATTGAATAGGTGAAAGATAACT^ 

TCAGGTGAAAGGCATTCATCATTTTTAAGCTGAAAAGGGGATCCTTGAAAACACTGAAAACCTCTACAACAATCT 

TCAGGAAGCCTGCTATCTTGGGATTCACTAATAATAGGCCAAGAACAAAGGCAAGCATCCATTCCTCACTCCACC 

ACTTTTCTATTTCAGTGGGTGTCATTGCTACGATGAAGACTTTGGAAAT 

GGATTTAGGACTCATAGCCTGAAAGCTCATTACATACTCCTTGTAACCATCAGTCCAAGGTTCAGTTCACTAAAG 

TGCATGTTCTAAAACAAGAGCTATCCTCATTCCAAATTTTAAAATATGTACTCTGGCCGGTTGCAGTGGCTCACG 

CCTGTAATCCCAGCACTTTGGCAGGCCGAGATGGGCGGATCTTTTGAGGTCAGGAGTTTGAGACCAGCCTGGCCA 

ACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCAGGCATGGTGGCATTTGCCTGTAATCCCAGCT 

ACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGCAGAGGTTGCAGTGAGCTGAGATTACACCACT^ 

CACTCCAGCCTGGGTGACAGAGTGAGACTCCATCTCAAAAACTGAAAATAAAAATAAAAATATCT 

CTGAAATATTTACTTAATCTGGAAAACAATGTAACTATTTTTAAAGTGGTTACATCTATTCTTGCTGAAGAACAA 

TAAACAGAATTTTTTGACTAAGCATAACCAAATTTCAGAACAGTCTAATCAATGCCAAGTATCCAAGGCAAACTC 

TAATACCCATCCATTGTGCAAAACCACAAGCACGCAAGTATTAAATAAGAGCAAGCTGTCCTGAGCCCATACCTA 

ATGAATTTGTGTCTTAAATATTGTACATTGTGTTTGAGGCTTGTCAAAACTGGGATTATGGCAAGAAAGGTTGCC 

TAACTCATACCTTTCTGCCTCAAATTCCAGGTGCTAAAGGCTAATGGCATTTTAAACATCTTACATTTTTAAAAA 

TTTATATTGCCTCTGCCAAACAGGCCTAATAGTTAAAAGCAAGTTGAGACAAACCAGGCAGATTCAGTGTGTGGA 

ACAGGAAGGATGTGCTTTAAAAAAAGGTGGAATCCCTCAAAAAATTCTATAGGGAGACAGCAGCCTTAATCTACA 

TAATTCTTCATCTCGCCAATTCAGCCGCAGCCTTTAAAGAGTTAGTGTTAATGGCTTTCTGGTTTGAAAACAAAA 

ATGCATCTATGTGGTTGAAAGTTTGGGAGGAGATTCACCAATATCTGAGGAGAAGATGGAGTGAAGGGAATTCTT 

ACTTTTTGCTTTATACCTTTCTATAATATTTAGATTTTTTTTTACTGTAAGTATGGATCAAATTGCAAAATAAAG 

aIStSc^ccttagaaaagac^ 

CATTTTGCTCTTTTTAAATCTATGTTTAGAACTTTATATCTTGGGACTTATGTATATATATACCTTTTAAATAAA 
AT AAATT TT CT AAAT AAAAAGTT G 
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FIGURE 126 

MVELFIFLFLLGETPFKVWKSLSPKELVRIHVPKPLDRNDGTFLMRYRMYETVDEGLKIEVL 
YGDEHVAQSPYILKGPVYHEYCECPEDPQAWQKTLSCPTKEPQIAKDFASFPSINLQQMLKEV 
PKRFGDERGAIVHYTILNNHVYRRSLGKYTDFKMFSDEILLSLTRKVLLPDLEFYVNLGDWPL 
EHRKVNGTPSPIPIISWCGSLDSRDWLPTYDITHSMLEAMRGVTNDLLSIQGNTGPSWINKT 

ERAFFRGRDS REERLQLVQLS KEN PQLLDAG I T G Y FFFQEKEKELGKAKLMGFFDFFKY KYQV 
NVDGTVAAYRYPYLMLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVKWAKEND 
EEAKKIAKEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKPEVRDGMELVPQPEDSTAICQC 

HRKKPSREEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 250-254, 363-367 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 444-448 

N-myristoylation site. 

amino acids 208-214, 319-325, 388-394 

Endoplasmic reticulum targeting sequence. 

amino acids 4 48-453 

Mitochondrial energy transfer proteins signature. 

amino acids 25-34 
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FIGURE 127 

AGCCGTCGGAGGGAGCCGGAGCGCTTCTCCCGAGTTGGTGATAGATTGGTGGTCATCCAACAT 

GCAGAAATGAATGAGCAGTGAAAAGCAGCAGAGCCGATGGGTCATGAGGATGTAAGTGCGTTT 

GAAGGCTTCCACACCCTCTACTCCAGGAATCATGAATAAACTGGAGGATAAGCAGGACCAGAT 

GATACCATSAAGAGAAGTTTACAGGCCCTCTATTGCCAACTGTTAAGTTTCCTGCTGATCTTG 

GCACTGACCGAAGCGCTGGCATTTGCCATCCAGGAACCATCTCCCAGGGAATCTCTTCAGGTC 

CTCCCTTCAGGCACTCCCCCGGGAACCATGGTGACAGCACCCCACAGCTCTACCAGACATACT 

TCTGTGGTGATGCTGACCCCCAATCCCGATGGACCCCCCTCACAGGCTGCAGCTCCCATGGCA 

ACACTGACACCCCGTGCAGAGGGGCACCCTCCTACGCACACCATCTCCACCATCGCTGCGACA 

GTAACCGCCCCCTATTCTGAAAGCTCCCTGTCCACAGGGCCCGCTCCAGCAGCCATGGCAACC 

ACATCCTCCAAGCCAGAGGGCCGCCCTCGAGGGCAGGCTGCCCCCACCATCCTGCTGACAAAG 

CCACCGGGGGCCACCAGCCGCCCCACCACAGCGCCCCCCCGCACTACCACACGCAGGCCCCCC 

AGGCCCCCAGGCTCTTCCCGAAAAGGGGCTGGTAATTCATCACGCCCTGTCCCGCCTGCACCT 

GGTGGCCACTCCAGGAGTAAAGAAGGACAGCGAGGACGAAATCCAAGCTCCACACCTCTGGGG 

CAGAAGCGGCCCCTGGGGAAAATCTTTCAGATCTACAAGGGCAACTTCACAGGGTCTGTGGAA 

CCAGAGCCCTCTACCCTCACCCCCAGGACCCCACTCTGGGGCTACTCCTCTTCACCACAGCCC 

CAGACAGTGGCTGCGACCACAGTGCCCAGCAATACCTCATGGGCACCCACCACCACCTCCCTG 

GGGCCTGCAAAGGACAAGCCAGGCCTTCGCAGAGCAGCCCAGGGGGGTGGTTCTACCTTCACC 

AGCCAAGGAGGGACACCAGATGCCACAGCAGCCTCAGGTGCCCCTGTCAGTCCACAAGCTGCC 

CCAGTGCCTTCTCAGCGCCCCCACCACGGTGACCCACAGGATGGCCCCAGCCATAGTGACTCT 

TGGCTTACTGTTACCCCTGGCACCAGCAGACCTCTGTCTACCAGCTCTGGGGTCTTCACGGCT 

GCCACGGGGCCCACCCCAGCTGCCTTCGATACCAGTGTCTCAGCCCCTTCCCAGGGGATTCCT 

CAGGGAGCATCCACAACCCCACAAGCTCCAACCCATCCCTCCAGGGTCTCAGAAAGCACTATT 

TCTGGAGCCAAGGAGGAGACTGTGGCCACCCTCACCATGACCGACCGGGTGCCCAGTCCTCTC 

TCCACAGTGGTATCCACAGCCACAGGCAATTTCCTCAACCGCCTGGTCCCCGCCGGGACCTGG 

AAGCCTGGGACAGCAGGGAACATCTCCCATGTGGCCGAGGGGGACAAACCGCAGCACAGAGCC 

ACCATCTGCCTGAGCAAGATGGATATCGCCTGGGTGATCCTGGCCATCAGCGTGCCCATCTCC 

TCCTGCTCTGTCCTGCTGACGGTGTGCTGCATGAAGAGGAAGAAGAAGACCGCCAACCCGGAG 

AACAACCTGAGCTACTGGAACAACACCATCACCATGGACTACTTCAACAGGCATGCTGTGGAG 

CTGCCCAGGGAGATCCAGTCCCTTGAAACCTCTGAGGACCAGCTCTCAGAGCCCCGCTCCCCA 

GCCAATGGGGACTATAGAGACACTGGGATGGTCCTTGTTAACCCCTTCTGTCAAGAAACACTG 

TTTGTGGGAAACGATCAAGTATCTGAGATCTAACTACAGCAGGCATCACTTTGCCATTCCGTA 

TTTTTCGTCTCTAAATTATAAATATACAAATATATATATTATAAATATAACCTTGTGTAACCC 

TGACTTAATGAGAAACATTTTCAGCTTTTTTTCCTATGAATTGTCAACATCTTTTTTACAAGT 

GTGGTTTAAAAAAAAAAAAACTTTACAGAATGATCTGTGGCTTTATAAAATAAAGGTATTTCT 

AAGCAAAAAAAAAAAAAAAAA 
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FIGURE 128 

MKRSLQALYCQLLSFLLILALTEALAFAIQEPSPRESLQVLPSGTPPGTMVTAPHSSTRHTSV 
VMLTPNPDGPPSQAAAPMATLTPRAEGHPPTHTISTIAATVTAPYSESSLSTGPAPAAMATTS 
SKPEGRPRGQAAPTILLTKPPGATSRPTTAPPRTTTRRPPRPPGSSRKGAGNSSRPVPPAPGG 
HSRSKEGQRGRNPSSTPLGQKRPLGKIFQIYKGNFTGSVEPEPSTLTPRTPLWGYSSSPQPQT 
VAATTVPSNTSWAPTTTSLGPAKDKPGLRRAAQGGGSTFTSQGGTPDATAASGAPVSPQAAPV 
PSQRPHHGDPQDGPSHSDSWLTVTPGTSRPLSTSSGVFTAATGPTPAAFDTSVSAPSQGIPQG 
ASTTPQAPTHPSRVSESTISGAKEETVATLTMTDRVPSPLSTWSTATGNFLNRLVPAGTWKP 
GTAGNISHVAEGDKPQHRATICLSKMDIAWVILAISVPISSCSVLLTVCCMKRKKKTANPENN 
LSYWNNTITMDYFNRHAVELPREIQSLETSEDQLSEPRSPANGDYRDTGMVLVNPFCQETLFV 

GNDQVSEI 



Important features of the protein: 
Signal peptide: 

amino acids 1-28 



Transmembrane domain : 

amino acids 469-487 

N-glycosylation sites . 

amino acids 178-182, 223-227, 261-265, 446-450, 504-508, 509-513 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 495-499 

N-myristoylation sites. 

amino acids 44-50, 48-54, 175-181, 222-228^ 279-285, 286-292, 
288-294, 296-302, 351-357, 374-380, 427-433, 442-448 



TonB-dependent receptor proteins signature 1. 

amino acids 1-44 
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FIGURE 129 

AGGCGAGGCGCGGCGCCGCTGCACACACGCACACGGAGCTATGGGGTGCCATGTTGCCACCAG 

CTGCCACGTGGCCTGGCTTTTGGTGCTGATCTCTGGATGCTGGGGCCAGGTGAACCGGCTGCC 

CTTCTTCACCAACCACTTCTTTGATACATACCTGCTGATCAGCGAGGACACGCCTGTGGGTTC 

TTCTGTGACCCAGTTGCTGGCCCAAGACATGGACAATGACCCCCTGGTGTTTGGCGTGTCTGG 

GGAGGAGGCCTCTCGCTTCTTTGCAGTGGAGCCTGACACTGGCGTGGTGTGGCTCCGGCAGCC 

ACTGGACAGAGAGACCAAGTCAGAGTTCACCGTGGAGTTCTCTGTCAGCGACCACCAGGGGGT 

GATCACACGGAAGGTGAACATCCAGGTCGGGGATGTGAATGACAACGCGCCCACATTTCACAA 

TCAGCCCTACAGCGTCCGCATCCCTGAGAATACACCAGTGGGGACGCCCATCTTCATCGTGAA 

TGCCACAGACCCCGACTTGGGGGCAGGGGGCAGCGTCCTCTACTCCTTCCAGCCCCCCTCCCA 

ATTCTTCGCCATTGACAGCGCCCGCGGTATCGTCACAGTGATCCGGGAGCTGGACTACGAGAC 

CACACAGGCCTACCAGCTCACGGTCAACGCCACAGATCAAGACAAGACCAGGCCTCTGTCCAC 

CCTGGCCAACTTGGCCATCATCATCACAGATGTCCAGGACATGGACCCCATCTTCATCAACCT 

GCCTTACAGCACCAACATCTACGAGCATTCTCCTCCGGGCACGACGGTGCGCATCATCACCGC 

CATAGACCAGGATAAAGGACGTCCCCGGGGCATTGGCTACACCATCGTTTCAGGGAATACCAA 

CAGCATCTTTGCCCTGGACTACATCAGCGGAGTGCTGACCTTGAATGGCCTGCTGGACCGGGA 

GAACCCCCTGTACAGCCATGGCTTCATCCTGACTGTGAAGGGCACGGAGCTGAACGATGACCG 

CACCCCATCTGACGCTACAGTCACCACGACCTTCAATATCCTGGTTATTGACATCAATGACAA 

TGCCCCGGAGTTCAACAGCTCCGAGTACAGCGTGGCCATCACTGAGCTGGCACAGGTCGGCTT 

TGCCCTTCCACTCTTCATCCAGGTGGTGGACAAGGATGAGAATTTGGGCCTGAACAGCATGTT 

TGAGGTGTACTTGGTGGGGAACAACTCCCACCACTTCATCATCTCCCCGACCTCCGTCCAGGG 

GAAGGCGGACATTCGTATTCGGGTGGCCATCCCACTGGACTACGAGACCGTGGACCGCTACGA 

CTTTGATCTCTTTGCCAATGAGAGTGTGCCTGACCATGTGGGCTATGCCAAGGTGAAGATCAC 

TCTCATCAATGAAAATGACAACCGGCCCATCTTCAGCCAGCCACTGTACAACATCAGCCTGTA 

CGAGAACGTCACCGTGGGGACCTCTGTGCTGACAGTCCTGGTGAGTCCCCGCTTCACTGCAGG 

GCCACTGAGCTCTCCAGGGCCGACTGTGGTGAGGCACCCAGAGGGATTTTGTCCAAGGGACCT 

CAGCAATCAGGGAAGGAGGCACCCCCAAATCCCTGAGCTGTGTTTGTTGGTGTATTAAATAAA 

GTTTTTGGACTCTTCAGGAAGGGGCTCCCTTGACCTAGGTTGCAATATGGAAAAGGAGCCAAC 

CTGAGGGGTGACGAGACTGAGCTGAGGACACTGGTTTTCTGCCTTTCCCTGAGAGAGACTCAG 

TGAGGGTGGGCTGGGAGCCCTGGAAGCCCCCTCAAATGGGTGGGAAGGTGCCAGCCATCCTTG 

AGAAGGGCAACCCTCTCCATGTGAGCACAGGCACCAGAGAGGGGCAGGCGCCTGGAGGGTACC 

GGGGCACCCCCAGCTGCCCATGGCTGGACTTGCCCTTTGACAAGGGGCCCTCCCAGTGTCATT 

TGTATCTGTCAGTACTCTTGGTTGCAAGGGACAGAAACCCTTAAGTAGTTCAAGCAAAAAAGG 

ATTGGCTCATGTAACTCAAAAGTATAAGTGATTTCAGGCCGGGCTCGGTGGCTCACGCCTGTC 

ATCCAACACCTTGAGAAAGCCGAGGTGGGCGGATCACTTGAGGTCGGGAGTTTGAGACCAGCC 

TGGCCAACATGGCAAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCGGGTGTGGTGGCAC 

ACGCCTGTAGTCCCAGCTACTAGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCGG 

AGGTTGCAGTGAGCCGAGATTGTGTCACTGCCCTCCAGCCTGGGCGACAGAGCCAGATTCTGT 

CTC 
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FIGURE 130 

MGCHVATSCHVAWLLVLISGCWGQVNRLPFFTNHFFDTYLLISEDTPVGSSVTQLLAQDMDND 
PLVFGVSGEEASRFFAVEPDTGVWLRQPLDRETKSEFTVEFSVSDHQGVITRKVNIQVGDVN 
DNAPTFHNQPYSVRIPENTPVGTPIFIVNATDPDLGAGGSVLYSFQPPSQFFAIDSARGIVTV 
IRELDYETTQAYQLTVNATDQDKTRPLSTLANLAIIITDVQDMDPIFINLPYSTNIYEHSPPG 
TTVRIITAIDQDKGRPRGIGYTIVSGNTNSIFALDYISGVLTLNGLLDRENPLYSHGFILTVK 
GTELNDDRTPSDATVTTTFNILVIDINDNAPEFNSSEYSVAITELAQVGFALPLFIQVVDKDE 
NLGLNSMFEVYLVGNNSHHFIISPTSVQGKADIRIRVAIPLDYETVDRYDFDLFANESVPDHV 

G YAKVKI TL IN EN DNRP I FSQ PL YN I SL YEN VT VGT SVLT VLVS PRFTAGPLS S PGPT VVRH P 
EG FC PRDLSNQGRRH PQ I PELCLLVY 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain : 
amino acids 355-374 
N-glycosylation sites. 

amino acids 155-159, 206-210, 349-353, 393-397, 434-438, 466-470, 
472-476 

N-myristoylation sites. 

amino acids 2-8, 49-55, 162-168, 270-276, 278-284, 316-322 
Amidation site, 
amino acids 515-519 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 11-22 
Leucine zipper pattern. 

amino acids 298-320 

PTS HPR component serine phosphorylation site signature. 

amino acids 377-393 

Cadherins extracellular repeated domain signature. 

amino acids 120-131, 336-347 
Cadherins extracellular 

amino acids 120-144, 336-360 
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GTGGGCCGCCCCTGCTGCTGCCGTCCAT6CTGATGTTTGCGGTGATCGTGGCCTCCAGCGGGC 

TGCTGCTCATGATCGAGCGGGGCATCCTGGCCGAGATGAAGCCCCTGCCCCTGCACCCGCCCG 

GCCGCGAGGGCACAGCCTGGCGCGGGAAAGCCCCCAAGCCTGGGGGCCTGTCCCTCAGGGCTG 

GGGACGCGGACTTGCAAGTGCGGCAGGACGTCCGGAACAGGACCCTGCGGGCGGTGTGCGGAC 

AGCCAGGCATGCCCCGGGACCCCTGGGACTTGCCGGTGGGGCAGCGGCGCACCCTGCTGCGCC 

ACATCCTCGTAAGTGACCGTTACCGCTTCCTCTACTGCTACGTCCCCAAGGTGGCCTGCTCTA 

ACTGGAAGCGGGTGATGAAGGTGCTGGCAGGCGTCCTGGACAGCGTGGACGTCCGCCTCAAGA 

TGGACCACCGCAGTGACCTGGTGTTCCTGGCCGACCTGCGGCCTGAGGAGATTCGCTACCGCC 

TGCAGCACTACTTTAAGTTCCTGTTTGTGCGGGAGCCCTTGGAACGCCTCCTCTCTGCCTACC 

GCAACAAGTTTGGCGAGATCCGAGAGTACCAGCAACGCTATGGGGCTGAGATAGTGAGGCGGT 

ACAGGGCTGGAGCGGGGCCCAGCCCTGCAGGCGACGATGTCACATTCCCCGAGTTCCTGAGAT 

ACCTGGTGGATGAGGACCCTGAGCGCATGAATGAGCATTGGATGCCCGTGTACCACCTGTGCC 

AGCCTTGTGCCGTGCACTATGACTTTGTGGGCTCCTATGAGAGGCTGGAGGCTGATGCAAATC 

AGGTGCTGGAGTGGGTACGGGCACCACCTCACGTCCGATTTCCAGCTCGCCAGGCCTGGTACC 

GGCCAGCCAGCCCCGAAAGCCTGCATTACCACTTGTGCAGTGCCCCCCGGGCCCTGCTGCAGG 

ATGTGCTGCCTAAGTATATCCTGGACTTCTCCCTCTTTGCCTACCCACTGCCTAATGTCACCA 

AGGAGGCGTGTCAGCAGTGACCATGGGTGTGGGGCCAGCAGCTGGTGGGGACTGGTTTCAACG 

CCAGCTTTCTGTGCTTCTGCCTGTCATTCGGAGAAACTCTGGCTCTGGGGCTTGGGGCTTCTC 

AGGATCCTGGATGGCAGAGACTGCCCTCAGAAGTTCCTTGTCCAGGGTGGGCACCCACAGTGA 

CTCAGAGGACAGGGCTAGGCAGGAGACCTGCTGCTCCTCATTGGGGGGATCTCTTGGGGGGCA 

GACACCAGTTTGCCAATGAAGCAACACATCTGATCTAAAGACTGGCTCCAGACCCCGGGCTGC 

CAGGATTATGCAGTCCACTTGGTCTACCTTAATTTAACCTGTGGCCAAACTCAGAGATGGTAC 

CAGCCAGGGGCAAGCATGACCAGAGCCAGGGACCCTGTGGCTCTGATCCCCCATTTATCCACC 

CCATGTGCCTCAGGACTAGAGTGAGCAATCATACCTTATAAATGACTTTTGTGCCTTTCTGCT 

CCAGTCTCAAAATTTCCTACACCTGCCAGTTCTTTACATTTTTCCAAGGAAAGGAAAACGGAA 

GCAGGGTTCTTGCCTGGTAGCTCCAGGACCCAGCTCTGCAGGCACCCAAAGACCCTCTGTGCC 

CAGCCTCTTCCTTGAGTTCTCGGAACCTCCTCCCTAATTCTCCCTTCCTTCCCCACAAGGCCT 

TTGAGGTTGTGACTGTGGCTGGTATATCTGGCTGCCATTTTTCTGATGCATTTATTTAAAATT 

TGTACTTTTTGATAGAACCCTTGTAAGGGCTTTGTTTTCCTAATAGCTGACTTTTTAATAAAG 



CAGTTTTATATAT 
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FIGURE 132 

MLMFAVIVASSGLLLMIERGILAEMKPLPLHPPGREGTAWRGKAPKPGGLSLRAGDADLQVRQ 
DVRNRTLRAVCGQPGMPRDPWDLPVGQRRTLLRHILVSDRYRFLYCYVPKVACSNWKRVMKVL 
AGVLDSVDVRLKMDHRSDLVFLADLRPEEIRYRLQHYFKFLFVREPLERLLSAYRNKFGEIRE 
YQQRYGAEIVRRYRAGAGPSPAGDDVTFPEFLRYLVDEDPERMNEHWMPVYHLCQPCAVHYDF 
VGSYERLEADANQVLEWVRAPPHVRFPARQAWYRPASPESLHYHLCSAPRALLQDVLPKYILD 

FSLFAYPLPNVTKEACQQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

N-glycosylation sites . 

amino acids 67-71, 325-329 

Tyrosine kinase phosphorylation sites. 

amino acids 152-159, 183-183 



N-myristoylation sites . 

amino acids 89-95, 128-134 
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FIGURE 133 

CGGCAGTTCTGGCCCCTGCAGCTGGAGGTACCCTGAGTTCTGAGGGTCGTAGTGCTGTTTCTG 

GTATTCTCATCGCGGTCACCTCTACCGGTGTGGACAAGTAAAGTTTGAATCAGCTTCTCCATG 

GCCTGGGCACCAGTTCCCGGCTGAGCCATTTTCCTTTTGGCTAAAAGTCCCCGCCCAGAGGCC 

AATTCGTCGCGGCGGCGGTGGAGATCGCAGGTCGCTCAGGCTTGCAGATGGGTCAAGGGTTGT 

GGAGAGTGGTCAGAAACCAGCAGCTGCAACAAGAAGGCTACAGTGAGCAAGGCTACCTCACCA 

GAGAGCAGAGCAGGAGAATGGATGCGAGCAACATTTCTAACACCAATCATCGTAAACAAGTCC 

AAGGAGGCATTGACATATATCATCTTTTGAAGGCAAGGAAATCGAAAGAACAGGAAGGATTCA 

TTAATTTGGAAATGTTGCCTCCTGAGCTAAGCTTTACCATCTTGTCCTACCTGAATGCAACTG 

ACCTTTGCTTGGCTTCATGTGTTTGGCAGGACCTTGCGAATGATGAACTTCTCTGGCAAGGGT 

TGTGCAAATCCACTTGGGGTCACTGTTCCATATACAATAAGAACCCACCTTTAGGATTTTCTT 

TTAGAAAATTGTATATGCAGCTGGATGAAGGCAGCCTCACCTTTAATGCCAACCCAGATGAGG 

GAGTGAACTACTTTATGTCCAAGGGTATCCTGGATGATTCGCCAAAGGAAATAGCAAAGTTTA 

TCTTCTGTACAAGAACACTAAATTGGAAAAAACTGAGAATCTATCTTGATGAAAGGAGAGATG 

TCTTGGATGACCTTGTAACATTGCATAATTTTAGAAATCAGTTCTTGCCAAATGCACTGAGAG 

AATTTTTTCGTCATATCCATGCCCCTGAAGAGCGTGGAGAGTATCTTGAAACTCTTATAACAA 

AGTTCTCACATAGATTCTGTGCTTGCAACCCTGATTTAATGCGAGAACTTGGCCTTAGTCCTG 

ATGCTGTCTATGTACTGTGCTACTCTTTGATTCTACTTTCCATTGACCTCACTAGCCCTCATG 

TGAAGAATAAAATGTCAAAAAGGGAATTTATTCGAAATACCCGTCGCGCTGCTCAAAATATTA 

GTGAAGATTTTGTAGGGCATCTTTATGACAATATCTACCTTATTGGCCATGTGGCTGCA2AAA 

AAGCACAATTGCTAGGACTTCAGTTTTTACTTCAGACTAAAGCTACCCAAGGACTTAGCAGAT 

ATGGGGGTTACATCAGTGCTGGTCATTGTAGCCTGAGTATACAATCAAGCTTCAGTGTGCAAC 

CTTTTTTTCTTTTGCCATTTTCTATTTTAGTAATTTCCTTGGGGAACTAAATAATTTTGCAGA 

ATTTTTCCTAATTTTGTTTATCACGTTTTGCACAAAGCAGAGCCACTGTCTAACACAGCTGTT 

AACGAATGATAAACTGACATTATACTCTAAAAGATGGTGTATTTGTGCATTAGATTTGCCTGA 

AAAACTTTATCCATTTCCATTCTTTATACAAATACCATGTAATGTGTACATATTTAACTAAAG 

AGATTTATAGTCATAATTATTTTATTGTAAAGATTTTAACTAAAGTTTTTCCTTTTCTCTC 
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FIGURE 134 

MGQGLWRWRNQQLQQEGYSEQGYLTREQSRRMDASNISNTNHRKQVQGGIDIYHLLKARKSK 
EQEGFINLEMLPPELSFTILSYLNATDLCLASCVWQDLANDELLWQGLCKSTWGHCSIYNKNP 
PLGFSFRKLYMQLDEGSLTFNANPDEGVNYFMSKGILDDSPKEIAKFIFCTRTLNWKKLRIYL 
DERRDVLDDLVTLHNFRNQFLPNALREFFRHIHAPEERGEYLETLITKFSHRFCACNPDLMRE 
LGLSPDAVYVLCYSLILLSIDLTSPHVKNKMSKREFIRNTRRAAQNISEDFVGHLYDNIYLIG 

HVAA 

Important features of the protein: 
Transmembrane domain : 

amino acids 253-272 

N-glycosylation sites . 

amino acids 37-41, 87-91, 298-302 

N-myristoylation site. 

amino acids 110-116 
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FIGURE 135 

GGCACGAGGGAGCCTCCGTTAGGGGGTGGGAAAGGACTTTGCCATAGGTCGCTGAGGCCACCA 
TCTGCTCTCTTACTGGCCAAGGGCGTAAAAAGATAGTCTTCCCATTAGCTAGAGAGCAAACCC 
CAGAAAGCCTATTGGCTGCGCCGTCCGCGGGCCTTGGTCCGCTTTGAAGGCGGGCTGCGGCTG 
CGAGAGGAGGGCGGGCGGGAGGCTAGCTGTTGTCGTGGTTGCTCGGAGGCACGTGTGCAGTCC 
CGGAAGCGGCGAGGGGAAACTGCTCCGCGCGCGCCGCGGGAGGAGGAACCGCCCGGTCCTTTA 
GGGTCCGGGCCCGGCCGGGCCAT6GATTCAATGCCTGAGCCCGCGTCCCGCTGTCTTCTGCTT 
CTTCCCTTGCTGCTGCTGCTGCTGCTGCTGCTGCCGGCCCCGGAGCTGGGCCCGAGCCAGGCC 
GGAGCTGAGGAGAACGACTGGGTTCGCCTGCCCAGCAAATGCGAAGTGTGTAAATATGTTGCT 
GTGGAGCTGAAGTCAGCCTTTGAGGAAACCGGCAAGACCAAGGAGGTGATTGGCACGGGCTAT 
GGCATCCTGGACCAGAAGGCCTCTGGAGTCAAATACACCAAGTCGGACTTGCGGTTAATCGAA 
GTCACTGAGACCATTTGCAAGAGGCTCCTGGATTATAGCCTGCACAAGGAGAGGACCGGCAGC 
AATCGATTTGCCAAGGGCATGTCAGAGACCTTTGAGACATTACACAACCTGGTACACAAAGGG 
GTCAAGGTGGTGATGGACATCCCCTATGAGCTGTGGAACGAGACTTCTGCAGAGGTGGCTGAC 
CTCAAGAAGCAGTGTGATGTGCTGGTGGAAGAGTTTGAGGAGGTGATCGAGGACTGGTACAGG 
AACCACCAGGAGGAAGACCTGACTGAATTCCTCTGCGCCAACCACGTGCTGAAGGGAAAAGAC 
ACCAGTTGCCTGGCAGAGCAGTGGTCCGGCAAGAAGGGAGACACAGCTGCCCTGGGAGGGAAG 
AAGTCCAAGAAGAAGAGCAGCAGGGCCAAGGCAGCAGGCGGCAGGAGTAGCAGCAGCAAACAA 
AGGAAGGAGCTGGGTGGCCTTGAGGGAGACCCCAGCCCCGAGGAGGATGAGGGCATCCAGAAG 
GCATCCCCTCTCACACACAGCCCCCCTGATGAGCTCTGAGCCCACCCAGCATCCTCTGTCCTG 
AGACCCCTGATTTTGAAGCTGAGGAGTCAGGGGCATGGCTCTGGCAGGCCGGGATGGCCCCGC 
AGCCTTCAGCCCCTCCTTGCCTTGGCTGTGCCCTCTTCTGCCAAGGAAAGACACAAGCCCCAG 
GAAGAACTCAGAGCCGTCATGGGTAGCCCACGCCGTCCTTTCCCCTCCCCAAGTGTTTCTCTC 
CTGACCCAGGGTTCAGGCAGGCCTTGTGGTTTCAGGACTGCAAGGACTCCAGTGTGAACTCAG 
GAGGGGCAGGTGTCAGAACTGGGCACCAGGACTGGAGCCCCCTCCGGAGACCAAACTCACCAT 
CCCTCAGTCCTCCCCAACAGGGTACTAGGACTGCAGCCCCCTGTAGCTCCTCTCTGCTTACCC 
CTCCTGTGGACACCTTGCACTCTGCCTGGCCCTTCCCAGAGCCCAAAGAGTAAAAATGTTCTG 
GTTCTGATTTCTGAAAAAAAAAAAAAAAAAAATTCCT 
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FIGURE 136 

MDSMPEPASRCLLLLPLLLLLLLLLPAPELGPSQAGAEENDWVRLPSKCEVCKYVAVELKSAF 
EETGKTKEVIGTGYGILDQKASGVKYTKSDLRLIEVTETICKRLLDYSLHKERTGSNRFAKGM 
SETFETLHNLVHKGVKWMDIPYELWNETSAEVADLKKQCDVLVEEFEEVIEDWYRNHQEEDL 
TEFLCANHVLKGKDTSCLAEQWSGKKGDTAALGGKKSKKKSSRAKAAGGRSSSSKQRKELGGL 
EGDPSPEEDEGIQKASPLTHSPPDEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

N-glycosylation site. 

amino acids 153-157 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 227-231, 228-232 

Tyrosine kinase phosphorylation site. 

amino acids 142-150 ( 
N-myristoylation sites . 

amino acids 36-42, 74-80, 86-92, 125-131, 222-228, 237-243, 
250-256, 263-269 

Amidation sites. 

amino acids 212-216, 222-226 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 62-70 
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FIGURE 137 

CACGCCTCCCGCTGCCAGCCCGGCACCGGGATCTTAATCAGTCACTATGAAAACTCATTAGCT 
CCACAGCAATGAGTCCTCCACTGCTGAAGCTTGGCGCTGTGCTTAGTACCATGGCAATGATCT 
CAAACTGGATGTCCCAAACTCTCCCATCCTTGGTGGGACTGAACACCACGAGGCTGTCGACTC 
CGGATACCTTAACTCAGATTAGTCCTAAAGAAGGGTGGCAGGTGTACAGCTCAGCTCAGGATC 
CTGATGGGCGGTGCATTTGCACAGTTGTTGCTCCAGAACAAAACCTGTGTTCCCGGGATGCCA 
AAAGCAGGCAACTTCGCCAACTACTGGAAAAGGTTCAGAACATGTCCCAGTCTATTGAAGTCT 
TAAACTTGAGAACTCAGAGAGATTTCCAATATGTTTTAAAAATGGAAACCCAAATGAAAGGGC 
TGAAGGCAAAATTTCGGCAGATTGAAGATGATCGAAAGACACTTATGACCAAGCATT.TTCAGG 
AGTTGAAAGAGAAAATGGACGAGCTCCTGCCTTTGATCCCCGTGCTGGAACAGTACAAAACAG 
ATGCTAAGTTAATCACCCAGTTCAAGGAGGAAATAAGGAATCTGTCTGCTGTCCTCACTGGTA 
TTCAGGAGGAAATTGGTGCCTATGACTACGAGGAACTACACCAAAGAGTGCTGAGCTTGGAAA 
CAAGACTTCGTGACTGCATGAAAAAGCTAACATGTGGCAAACTGATGAAAATCACAGGCCCAG 
TTACAGTCAAGACATCTGGAACCCGATTTGGTGCTTGGATGACAGACCCTTTAGCATCTGAGA 
AAAACAACAGAGTCTGGTACATGGACAGTTATACTAACAATAAAATTGTTCGTGAATACAAAT 
CAATTGCAGACTTTGTCAGTGGGGCTGAATCAAGGACATACAACCTTCCTTTCAAGTGGGCAG 
GAACTAACCATGTTGTCTACAATGGCTCACTCTATTTTAACAAGTATCAGAGTAATATCATCA 
TCAAATACAGCTTTGATATGGGGAGAGTGCTTGCCCAACGAAGCCTGGAGTATGCTGGTTTTC 
ATAATGTTTACCCCTACACATGGGGTGGATTCTCTGACATCGACCTAATGGCTGATGAAATCG 
GGCTGTGGGCTGTGTATGCAACTAACCAGAATGCAGGCAATATTGTCATCAGCCAACTTAACC 
AAGATACCTTGGAGGTGATGAAGAGCTGGAGCAGTGGCTACCCCAAGAGAAGTGCAGGGGAAT 
CTTTCATGATCTGTGGGACACTGTATGTCACCAACTCCCACTTAACTGGAGCCAAGGTGTATT 
ATTCCTATTCCACCAAAACCTCCACATATGAGTACACAGACATTCCCTTCCATAACCAATACT 
TTCACATATCCATGCTTGACTACAATGCAAGAGATCGAGCTCTCTATGCCTGGAACAATGGCC 
ACCAGGTGCTGTTCAATGTCACCCTTTTCCATATCATCAAGACAGAGGATGACACATAGGCAA 
ATGTGACATGTTTTCATTGATTTAAACAGTGTGATTTGTGATAAACTCTATAAGACCCCTTCC 
GTTTTTTTCTTCACTATTATTTTTCATCATTTCTCCAAAGCAAAGCATTTTTATTGTAAAGTT 
GGTGTTTCAAAAACATAGCTGAGCTTGTCTAACTTACCATGTTGGAAACACATCTTAACTTCT 
AAATTTACAAGGCCTATCATGTCCTTGTCATGAAAAGCACTAAAAAAAAAAAAGAGTTTAAGT 
GGCTAAAGTCATAGTTTTGCAAGAGATTAATGATCTGCCTTATATTAGAGTCAGAGACTAATG 
GTGGCTTAAATGCACGAATGTCTTTTTTTTTAAAACTGTCATTTTTTACTGTCTTTTGCTCCA 
TCTCAGGAAATATTTTGGTAGGAATTAGGAGAACAAAAAGCACTTTTATCCCATTTATTTCTT 
TAAAAAATGTAAGGATTTCATTTATATTGAAAAATAATATTAATCATTTTGCTGTTAACACAA 
TTCTCTGATGCGGTGCTGTACAGTCATTTTTAAATCTCTTGCTAACATTTTATTGGCAGTATG 
TATTTCTACCATTGTAACCACCATTGTGCTATTGTATCTCTTCACTTCTGTGAAAGTAATATT 
TTTTATAAAANACACTGNAATTTTAAAAAAAAAAAAAAAAAACAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 138 

MSPPLLKLGAVLSTMAMISNWMSQTLPSLVGLNTTRLSTPDTLTQISPKEGWQVYSSAQDPDG 
RCICTVVAPEQNLCSRDAKSRQLRQLLEKVQNMSQSIEVLNLRTQRDFQYVLKMETQMKGLKA 
KFRQIEDDRKTLMTKHFQELKEKMDELLPLIPVLEQYKTDAKLITQFKEEIRNLSAVLTGIQE 
EIGAYDYEELHQRVLSLETRLRDCMKKLTCGKLMKITGPVTVKTSGTRFGAWMTDPLASEKNN 
RVWYMDSYTNNKIVREYKSIADFVSGAESRTYNLPFKWAGTNHVVYNGSLYFNKYQSNIIIKY 
SFDMGRVLAQRSLEYAGFHNVYPYTWGGFSDIDLMADEIGLWAVYATNQNAGNIVISQLNQDT 
LEVMKSWSTGYPKRSAGESFMICGTLYVTNSHLTGAKVYYSYSTKTSTYEYTDIPFHNQYFHI 

SMLDYNARDRALYAWNNGHQVLFNVTLFHIIKTEDDT 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 33-37, 95-99, 179-183, 299-303, 465-469 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 215-219 
♦ 

Tyrosine kinase phosphorylation site. 

amino acids 106-114 

N-myristoylation sites. 

amino acids 9-15, 31-37, 235-241, 239-245 
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FIGURE 139 

GAAGCAGTGCAGAGAGGAGAGCGGAGCGGAGCTGCCGCTGAGCAAAGGCCTTCACCATGGCCG 

AGTCCCCCGGCTGCTGCTCCGTCTGGGCCCGCTGCCTCCACTGCCTGTATAGCTGCCACTGGA 

GGAAATGCCCCAGAGAGAGGATGCAAACCAGCAAGTGCGACTGTATCTGGTTTGGCCTGCTCT 

TCCTCACCTTCCTCCTTTCCCTGAGCTGGCTGTACATCGGGCTCGTCCTTCTCAATGACCTGC 

ACAACTTCAATGAATTCCTCTTCCGCCGCTGGGGACACTGGATGGACTGGTCCCTGGCATTCC 

TGCTGGTCATCTCTCTACTGGTCACATATGCATCCTTGCTATTGGTCCTGGCCCTGCTCCTGC 

GGCTTTGTAGACAGCCCCTGCATCTGCACAGCCTCCACAAGGTGCTGCTGCTCCTCATTATGC 

TGCTTGTGGCGGCTGGCCTTGTGGGACTGGACATCCAATGGCAGCAGGAGTGGCATAGCTTGC 

GTGTGTCACTGCAGGCCACAGCCCCATTCCTTCATATTGGAGCAGCCGCTGGAATTGCCCTCC 

TGGCCTGGCCTGTGGCTGATACCTTCTACCGTATCCAGCGAAGAGGTCCCAAGATTCTGCTAC 

TGCTCCTATTTTTTGGAGTTGTCCTGGTCATCTACTTGGCCCCCCTATGCATCTCCTCACCCT 

GCATCATGGAACCCAGAGACTTACCACCCAAGCCTGGGCTGGTGGGACACCGAGGGGCCCCCA 

TGCTGGCTCCCGAGAACACCCTGATGTCCTTGCGGAAGACAGCTGAATGCGGAGCTACTGTGT 

TTGAGACTGATGTGATGGTCAGCTCCGATGGGGTCCCCTTCCTCATGCATGATGAGCACCTCA 

GCAGGACCACGAATGTAGCCTCTGTATTCCCAACCCGAATCACAGCCCACAGCAGTGACTTCT 

CCTGGACTGAACTGAAGAGACTCAATGCTGGATCCTGGTTCCTAGAGAGGCGACCCTTCTGGG 

GGGCCAAACCGCTGGCAGGCCCTGATCAGAAAGAGGCTGAGAGTCAGACGGTACCAGCATTAG 

AAGAGCTATTGGAGGAAGCTGCAGCCCTCAACCTTTCCATCATGTTCGACTTGCGCCGACCCC 

CACAGAACCACACATACTATGACACTTTTGTGATCCAGACATTGGAGACTGTGCTGAATGCAA 

GGGTGCCCCAAGCCATGGTCTTTTGGCTACCAGATGAAGATCGGGCTAATGTCCAACGACGGG 

CACCTGGAATGCGCCAGATATATGGACGTCAGGGAGGCAACAGAACGGAGAGGCCCCAGTTTC 

TTAACCTCCCCTATCAAGATCTGCCACTATTGGATATCAAGGCATTGCATAAGGATAATGTCT 

CGGTGAACCTATTTGTAGTGAACAAGCCCTGGCTCTTCTCTCTGCTTTGGTGTGCAGGGGTGG 

ATTCGGTCACCACCAACGACTGCCAGCTGCTGCAGCAGATGCGTTACCCTATCTGGCTTATTA 

CCCCTCAAACCTACCTAATCATATGGGTCATTACCAATTGTGTTTCCACCATGCTGCTTTTGT 

GGACCTTCCTCCTCCAAAGGAGATTTGTTAAGAAGAGAGGGAAAACTGGCTTAGAAACAGCAG 

TGCTGCTGACAAGGATCAACAATTTCATGATGGAGISAATGCCCTGCCCTGCTTCCCCACCCA 

AGCCAGTCTACATTGCCCAAACAGCAAGGGTTGGAGAGTGGCTTAAGTGGAATGCTTCAGGGG 

TGGTGGGTTGCAAGTGGGGGGAGCTTTGCCAACAGGAGGTTTTGAACCATGAGGGCCCTCTGC 

CCAGGTGATGGGCATTCCCTAAGCTGCTATGGAATCTGCTCCCTTTGGGGTTTTGACCTGAGA 

TGTTTGGGAAGAGAGTGAGTAATGAGAAGTTTCTCCTCAAATGAAACTAGAACAGAGGAAGTA 

AAAGGGAGATTGCTCGGA 
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FIGURE 140 

MAESPGCCSVWARCLHCLYSCHWRKCPRERMQTSKCDCIWFGLLFLTFLLSLSWLYIGLVLLN 
DLHNFNEFLFRRWGHWMDWSLAFLLVISLLVTYASLLLVLALLLRLCRQPLHLHSLHKVLLLL 
IMLLVAAGLVGLDIQWQQEWHSLRVSLQATAPFLHIGAAAGIALLAWPVADTFYRIHRRGPKI 
LLLLLFFG VVLVI YLAPLC I S S PC IME PRDL P PKPGLVGHRGAPMLAPENTLMSLRKT AECGA 
TVFETDVMVSSDGVPFLMHDEHLSRTTNVASVFPTRITAHSSDFSWTELKRLNAGSWFLERRP 
FWGAKPLAGPDQKEAESQTVPALEELLEEAAALNLSIMFDLRRPPQNHTYYDTFVIQTLETVL 
NARVPQAMVFWLPDEDRANVQRRAPGMRQIYGRQGGNRTERPQFLNLPYQDLPLLDIKALHKD 
NVSVNLFWNKPWLFSLLWCAGVDSVTTNDCQLLQQMRYPIWLITPQTYLIIWVITNCVSTML 

LLWTFLLQRRFVKKRGKTGLETAVLLTRINNFMME 

Important features of the protein: 
Transmembrane domains : 

amino acids 38-60, 83-107, 122-138, 156-173, 189-210, 484-506 
N-glycosylation sites, 

amino acids 349-353, 362-366, 415-419, 442-446 

N-myristoylation sites. 

amino acids 163-169, 413-419, 523-529 

Leucine zipper pattern. 

amino acids 93-115, 109-131 

Glutamine amidotransf erases class-II active site. 

amino acids 1-13 
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FIGURE 141 

GCCGCCGGCCCGGGCTGGAGCCGAGCGCAGCAGCCACCGCCGCCGCCGCGCCAGAAGTTTGGGTTGAACCGGAGC 

TGCCGGGAGGAAACTTTTTTCTTTTTTCCCCCTCCCTCCCGGGAGGAGGAGGAGGAGGAGGAGGGGAAGCTGCCG 

CCGGCGCCAAGGCTCGTGGGCTCGGGGTCGGCGCGGCCCGCAGAAGGGGCGGGGGCCTCGCCCCGCGAGGGGAGG 

CGCGCCCCGGGGGCCCCGAGAGGGGCGGTGAGGACCGCGGGCTGCTGGTGCGGCGGCGGCGGCGCGTGTGCCCCG 

CGCAGGGGAGGGCGCCCGCCCCGCTCCCGGCCCGGCTGCGAGGAGGAGGCGGCGGCGGCGCAGGAGGATGTACTT 

GGTGGCGGGGGACAGGGGGTTGGCCGGCTGCGGGCACCTCCTGGTCTCGCTGCTGGGGCTGCTGCTGCTGCTGGC 

GCGCTCCGGCACCCGGGCGCTGGTCTGCCTGCCCTGTGACGAGTCCAAGTGCGAGGAGCCCAGGAACTGCCCGGG 

GAGCATCGTGCAGGGCGTCTGCGGCTGCTGCTACACGTGCGCCAGCCAGAGGAACGAGAGCTGCGGCGGCACCTT 

CGGGATTTACGGAACCTGCGACCGGGGGCTGCGTTGTGTCATCCGCCCCCCGCTCAATGGCGACTCCCTCACCGA 

GTACGAAGCGGGCGTTTGCGAAGATGAGAACTGGACTGATGACCAACTGCTTGGTTTTAAACCATGCAATGAAAA 

CCTTATTGCTGGCTGCAATATAATCAATGGGAAATGTGAATGTAACACCATTCGAACCTGCAGCAATCCCTTTGA 

GTTTCCAAGTCAGGATATGTGCCTTTCAGCTTTAAAGAGAATTGAAGAAGAGAAGCCAGATTGCTCCAAGGCCCG 

CTGTGAAGTCCAGTTCTCTCCACGTTGTCCTGAAGATTCTGTTCTGATCGAGGGTTATGCTCCTCCTGGGGAGTG 

CTGTCCCTTACCCAGCCGCTGCGTGTGCAACCCCGCAGGCTGTCTGCGCAAAGTCTGCCAGCCGGGAAACCTGAA 

CATACTAGTGTCAAAAGCCTCAGGGAAGCCGGGAGAGTGCTGTGACCTCTATGAGTGCAAACCAGTTTTCGGCGT 

GGACTGCAGGACTGTGGAATGCCCTCCTGTTCAGCAGACCGCGTGTCCCCCGGACAGCTATGAAACTCAAGTCAG 

ACTAACTGCAGATGGTTGCTGTACTTTGCCAACAAGATGCGAGTGTCTCTCTGGCTTATGTGGTTTCCCCGTGTG 

TGAGGTGGGATCCACTCCCCGCATAGTCTCTCGTGGCGATGGGACACCTGGAAAGTGCTGTGATGTCTTTGAATG 

TGTTAATGATACAAAGCCAGCCTGCGTATTTAACAATGTGGAATATTATGATGGAGACATGTTTCGAATGGACAA 

CTGTCGGTTCTGTCGATGCCAAGGGGGCGTTGCCATCTGCTTCACTGCCCAGTGTGGTGAGATAAACTGCGAGAG 

GTACTACGTGCCCGAAGGAGAGTGCTGCCCAGTGTGTGAAGATCCAGTGTATCCTTTTAATAATCCCGCTGGCTG 

CTATGCCAATGGCCTGATCCTTGCCCACGGAGACCGGTGGCGGGAAGACGACTGCACATTCTGCCAGTGCGTCAA 

CGGTGAACGCCACTGCGTTGCGACCGTCTGCGGACAGACCTGCACAAACCCTGTGAAAGTGCCTGGGGAGTGTTG 

CCCTGTGTGCGAAGAACCAACCATCATCACAGTTGATCCACCTGCATGTGGGGAGTTATCAAACTGCACTCTGAC 

AGGGAAGGACTGCATTAATGGTTTCAAACGCGATCACAATGGTTGTCGGACCTGTCAGTGCATAAACACCGAGGA 

ACTATGTTCAGAAGGTAAACAAGGCTGCACCTTGAACTGTCCCTTCGGTTTCCTTACTGATGCCCAAAACTGTGA 

GATCTGTGAGTGCCGCCCAAGGCCCAAGAAGTGCAGACCCATAATCTGTGACAAGTATTGTCCACTTGGATTGCT 

GAAGAATAAGCACGGCTGTGACATCTGTCGCTGTAAGAAATGTCCAGAGCTCTCATGCAGTAAGATCTGCCCCTT 

GGGTTTCCAGCAGGACAGTCACGGCTGTCTTATCTGCAAGTGCAGAGAGGCCTCTGCTTCAGCTGGGCCACCCAT 

CCTGTCGGGCACTTGTCTCACCGTGGATGGTCATCATCATAAAAATGAGGAGAGCTGGCACGATGGGTGCCGGGA 

ATGCTACTGTCTCAATGGACGGGAAATGTGTGCCCTGATCACCTGCCCGGTGCCTGCCTGTGGCAACCCCACCAT 

TCACCCTGGACAGTGCTGCCCATCATGTGCAGATGACTTTGTGGTGCAGAAGCCAGAGCTCAGTACTCCCTCCAT 

TTGCCACGCCCCTGGAGGAGAATACTTTGTGGAAGGAGAAACGTGGAACATTGACTCCTGTACTCAGTGCACCTG 

CCACAGCGGACGGGTGCTGTGTGAGACAGAGGTGTGCCCACCGCTGCTCTGCCAGAACCCCTCACGCACCCAGGA 

TTCCTGCTGCCCACAGTGTACAGATCAACCTTTTCGGCCTTCCTTGTCCCGCAATAACAGCGTACCTAATTACTG 

CAAAAATGATGAAGGGGATATATTCCTGGCAGCTGAGTCCTGGAAGCCTGACGTTTGTACCAGCTGCATCTGCAT 

TGATAGCGTAATTAGCTGTTTCTCTGAGTCCTGCCCTTCTGTATCCTGTGAAAGACCTGTCTTGAGAAAAGGCCA 

GTGTTGTCCCTACTGCATAGAAGACACAATTCCAAAGAAGGTGGTGTGCCACTTCAGTGGGAAGGCCTATGCCGA 

CGAGGAGCGGTGGGACCTTGACAGCTGCACCCACTGCTACTGCCTGCAGGGCCAGACCCTCTGCTCGACCGTCAG 

CTGCCCCCCTCTGCCCTGTGTTGAGCCCATCAACGTGGAAGGAAGTTGCTGCCCAATGTGTCCAGAAATGTATGT 

CCCAGAACCAACCAATATACCCATTGAGAAGACAAACCATCGAGGAGAGGTTGACCTGGAGGTTCCCCTGTGGCC 

CACGCCTAGTGAAAATGATATCGTCCATCTCCCTAGAGATATGGGTCACCTCCAGGTAGATTACAGAGATAACAG 

GCTGCACCCAAGTGAAGATTCTTCACTGGACTCCATTGCCTCAGTTGTGGTTCCCATAATTATATGCCTCTCTAT 

TATAATAGCATTCCTATTCATCAATCAGAAGAAACAGTGGATACCACTGCTTTGCTGGTATCGAACACCAACTAA 

GCCTTCTTCCTTAAATAATCAGCTAGTATCTGTGGACTGCAAGAAAGGAACCAGAGTCCAGGTGGACAGTTCCCA 

GAGAATGCTA^GAATTGCAGAACCAGATGCAAGATTCAGTGGCTTCTACAGCATGCAAAAACAGAACCATCTACA 

GGCAGACAATTTCTACCAAACAGTGTGAAGAAAGGCAACTAGGATGAGGTTTCAAAAGACGGAAGACGACTAAAT 

CTGCTCTAAAAAGTAAACTAGAATTTGTGCACTTGCTTAGTGGATTGTATTGGATTGTGACTTGATGTACAGCGC 

TAAGACCTTACTGGGATGGGCTCTGTCTACAGCAATGTGCAGAACAAGCATTCCCACTTTTCCTCAAAAAA 
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FIGURE 142 

MYLVAGDRGLAGCGHLLVSLLGLLLLLARSGTRALVCLPCDESKCEEPRNCPGSIVQGVCGCC 
YTCASQRNESCGGTFGIYGTCDRGLRCVIRPPLNGDSLTEYEAGVCEDENWTDDQLLGFKPCN 
ENLIAGCNIINGKCECNTIRTCSNPFEFPSQDMCLSALKRIEEEKPDCSKARCEVQFSPRCPE 
DSVLIEGYAPPGECCPLPSRGVCNPAGCLRKVCQPGNLNILVSKASGKPGECCDLYECKPVFG 
VDCRTVECPPVQQTACPPDSYETQVRLTADGCCTLPTRCECLSGLCGFPVCEVGSTPRIVSRG 
DGTPGKCCDVFECVNDTKPACVFNNVEYYDGDMFRMDNCRFCRCQGGVAICFTAQCGEINCER 
YYVPEGECCPVCEDPVYPFNNPAGCYANGLILAHGDRWREDDCTFCQCVNGERHCVATVCGQT 
CTNPVKVPGECCPVCEEPTIITVDPPACGELSNCTLTGKDCINGFKRDHNGCRTCQCINTEEL 
CSERKQGCTLNCPFGFLTDAQNCEICECRPRPKKCRPIICDKYCPLGLLKNKHGCDICRCKKC 
PELSCSKICPLGFQQDSHGCLICKCREASASAGPPILSGTCLTVDGHHHKNEESWHDGCRECY 
CLNGREMCALITCPVPACGNPTIHPGQCCPSCADDFWQKPELSTPSICHAPGGEYFVEGETW 
NIDSCTQCTCHSGRVLCETEVCPPLLCQNPSRTQDSCCPQCTDQPFRPSLSRNNSVPNYCKND 
EGDIFLAAESWKPDVCTSCICIDSVISCFSESCPSVSCERPVLRKGQCCPYCIEDTIPKKWC 
HFSGKAYADEERWDLDSCTHCYCLQGQTLCSTVSCPPLPCVEPINVEGSCCPMCPEMYVPEPT 
NIPIEKTNHRGEVDLEVPLWPTPSENDIVHLPRDMGHLQVDYRDNRLHPSEDSSLDSIASVVV 
PI I ICLSI I IAFLFINQKKQWI PLLCWYRTPTKPSSLNNQLVSVDCKKGTRVQVDSSQRMLRI 
AEPDARFSGFYSMQKQNHLQADNFYQTV 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 
Transmembrane domain : 
amino acids 940-962 
N-glycosylation sites . 

amino acids 71-75, 113-117, 330-334, 474-478, 746-750 
CAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 992-996 
N-myristoylation site. 

amino acids 9-15, 58-64, 61-67, 75-81, 79-85, 362-368, 402-408, 407-413, 
439-445, 492-498, 511-517, 551-557, 558-564, 586-592, 606-612, 625-631, 
845-851 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 52-63, 844-855 
Cell attachment sequence, 
amino acids 314-317 
Leucine zipper pattern. 

amino acids 3-25 

Eukaryotic thiol (cysteine) proteases cysteine active site. 

amino acids 57-69 

VWFC domain proteins. 

amino acids 448-456, 382-390 

C-terminal cystine knot proteins 

amino acids 60-86 
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FIGURE 143 

GACGTCTGGCCGGCTCCCGGCGAAGGGCAGCGGAGGAGCGGCCCAGAGCGCGCAGCTAGGGCA 
CTGGCGAAACCCCGGGACAGTCCCTCTCCGTGCGGGGGCGGCGCAGAGCAGTCCCATCCCCGG 
GGTCCCGGGCGCGGCTGACTGCCGGCTGGTTCCCTGCGCGCAGTAGCTCCCCGAGCCGGGCTG 
CACCGGAGGCGGCGAGATGGTCGCGCGCGTCGGCCTCCTGCTGCGCGCCCTGCAGCTGCTACT 
GTGGGGCCACCTGGACGCCCAGCCCGCGGAGCGCGGAGGCCAGGAGCTGCGCAAGGAGGCGGA 
GGCATTCCTAGAGAAGTACGGATACCTCAATGAACAGGTCCCCAAAGCTCCCACCTCCACTCG 
ATTCAGCGATGCCATCAGAGCGTTTCAGTGGGTGTCCCAGCTACCTGTCAGCGGCGTGTTGGA 
CCGCGCCACCCTGCGCCAGATGACTCGTCCCCGCTGCGGGGTTACAGATACCAACAGTTATGC 
GGCCTGGGCTGAGAGGATCAGTGACTTGTTTGCTAGACACCGGACCAAAATGAGGCGTAAGAA 
ACGCTTTGCAAAGCAAGGTAACAAATGGTACAAGCAGCACCTCTCCTACCGCCTGGTGAACTG 
GCCTGAGCATCTGCCGGAGCCGGCAGTTCGGGGCGCCGTGCGCGCCGCCTTCCAGTTGTGGAG 
CAACGTCTCAGCGCTGGAGTTCTGGGAGGCCCCAGCCACAGGCCCCGCTGACATCCGGCTCAC 
CTTCTTCCAAGGGGACCACAACGATGGGCTGGGCAATGCCTTTGATGGCCCAGGGGGCGCCCT 
GGCGCACGCCTTCCTGCCCCGCCGCGGCGAAGCGCACTTCGACCAAGATGAGCGCTGGTCCCT 
GAGCCGCCGCCGCGGGCGCAACCTGTTCGTGGTGCTGGCGCACGAGATCGGTCACACGCTTGG 
CCTCACCCACTCGCCCGCGCCGCGCGCGCTCATGGCGCCCTACTACAAGAGGCTGGGCCGCGA 
CGCGCTGCTCAGCTGGGACGACGTGCTGGCCGTGCAGAGCCTGTATGGGAAGCCCCTAGGGGG 
CTCAGTGGCCGTCCAGCTCCCAGGAAAGCTGTTCACTGACTTTGAGACCTGGGACTCCTACAG 
CCCCCAAGGAAGGCGCCCTGAAACGCAGGGCCCTAAATACTGCCACTCTTCCTTCGATGCCAT 
CACTGTAGACAGGCAACAGCAACTGTACATTTTTAAAGGGAGCCATTTCTGGGAGGTGGCAGC 
TGATGGCAACGTCTCAGAGCCCCGTCCACTGCAGGAAAGATGGGTCGGGCTGCCCCCCAACAT 
TGAGGCTGCGGCAGTGTCATTGAATGATGGAGATTTCTACTTCTTCAAAGGGGGTCGATGCTG 
GAGGTTCCGGGGCCCCAAGCCAGTGTGGGGTCTCCCACAGCTGTGCCGGGCAGGGGGCCTGCC 
CCGCCATCCTGACGCCGCCCTCTTCTTCCCTCCTCTGCGCCGCCTCATCCTCTTCAAGGGTGC 
CCGCTACTACGTGCTGGCCCGAGGGGGACTGCAAGTGGAGCCCTACTACCCCCGAAGTGTGCA 
GGACTGGGGAGGCATCCCTGAGGAGGTCAGCGGCGCCCTGCCGAGGCCCGATGGCTCCATCAT 
CTTCTTCCGAGATGACCGCTACTGGCGCCTCGACCAGGCCAAACTGCAGGCAACCACCTCGGG 
CCGCTGGGCCACCGAGCTGCCCTGGATGGGCTGCTGGCATGCCAACTCGGGGAGCGCCCTGTT 
CTGAAGGCACCTCCTCACCTCAGAAACTGGTGGTGCTCTCAGGGCAAAATCATGTTCCCCACC 
CCCGGGGCAGAACCCCTCTTAGAAGCCTCTGAGTCCCTCTGCAGAAGACCGGGCAGCAAAGCC 
TCCATCTGGAAGTCTGTCTGCCTTTGTTCCTTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 144 

MVARVGLLLRALQLLLWGHLDAQPAERGGQELRKEAEAFLEKYGYLNEQVPKAPTSTRFSDAI 
RAFQWVSQLPVSGVLDRATLRQMTRPRCGVTDTNSYAAWAERISDLFARHRTKMRRKKRFAKQ 
GNKWYKQHLSYRLVNWPEHLPEPAVRGAVRAAFQLWSNVSALEFWEAPATGPADIRLTFFQGD 
HNDGLGNAFDGPGGALAHAFLPRRGEAHFDQDERWSLSRRRGRNLFVVLAHEIGHTLGLTHSP 
APRALMAPYYKRLGRDALLSWDDVLAVQSLYGKPLGGSVAVQLPGKLFTDFETWDSYSPQGRR 
PETQGPKYCHSSFDAITVDRQQQLYIFKGSHFWEVAADGNVSEPRPLQERWVGLPPNIEAAAV 
SLNDGDFYFFKGGRCWRFRGPKPVWGLPQLCRAGGLPRHPDAALFFPPLRRLILFKGARYYVL 
ARGGLQVEPYYPRSLQDWGGIPEEVSGALPRPDGSIIFFRDDRYWRLDQAKLQATTSGRWATE 

L PWMGC WHAN S G S AL F 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation sites. 

amino acids 164-168, 355-359 

N-myristoylation sites. 

amino acids 92-98, 153-159, 193-199, 202-208, 288-294, 368-374, 
509-515 

Amidation site. 

amino acids 312-316 

Neutral zinc metallopeptidases , zinc-binding region signature. 

amino acids 237-247 

Matrixins cysteine switch 

amino acids 231-262, 271-284 



Hemopexin domain protein 

amino acids 66-108, 231-262 
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FIGURE 145 

GCCGGCTAGGGCGCCGGAGCCGCACGCAGCCGCGGGGCTCCGAGAGGCGCGCACTGGGGCTGGGACTGCGCGGCG 

CCGCCGCTGCGAGCGCCACTGAGCGGTCGCGCAACTTCGGAGGCACAGCGCCGGAGCCAGGCGAGCGCTCAGAGA 

CCCGGAGCCAGAGGGGCGCGCCGGAGCCTCGTTCGAGAGCCGGCGCCAGGCACCCACCGCGCTCCGAGTGCCAGG 

CGGCCCTCCGCGCAGCGTGGCTTCCGCTGCCCCCACGGAAGGCACGGGCTGGCGCTGCCGGGCGCCGGGGAGGAC 

GGCGAGGAGGAGGCGGCGGCGGCGGAGACGGCGGCGGCGAGACTGGGGCCAGGGAGACAGCCCTGGGGGAGAGGC 

GCCCGAACCAGGCCGCGGGAGCATGGGGGCCCGGAGCGGAGCTCGGGGCGCGCTGCTGCTGGCACTGCTGCTCTG 

CTGGGACCCGAGGCTGAGCCAAGCAGGCACTGATTCTGGCAGCGAGGTGCTCCCTGACTCCTTCCCGTCAGCGCC 

AGCAGAGCCGCTGCCCTACTTCCTGCAGGAGCCACAGGACGCCTACATTGTGAAGAACAAGCCTGTGGAGCTCCG 

CTGCCGCGCCTTCCCCGCCACACAGATCTACTTCAAGTGCAACGGCGAGTGGGTCAGCCAGAACGACCACGTCAC 

ACAGGAAGGCCTGGATGAGGCCACCGGCCTGCGGGTGCGCGAGGTGCAGATCGAGGTGTCGCGGCAGCAGGTGGA 

GGAGCTCTTTGGGCTGGAGGATTACTGGTGCCAGTGCGTGGCCTGGAGCTCCGCAGGCACCACCAAGAGTCGCCG 

AGCCTACGTCCGCATCGCCTACCTGCGCAAGAACTTCGATCAGGAGCCTCTGGGCAAGGAGGTGCCCCTGGACCA 

TGAGGTTCTCCTGCAGTGCCGCCCGCCGGAGGGGGTGCCTGTGGCCGAGGTGGAATGGCTCAAGAATGAGGATGT 

CATCGACCCCACCCAGGACACCAACTTCCTGCTCACCATCGACCACAACCTCATCATCCGCCAGGCCCGCCTGTC 

GGACACTGCCAACTATACCTGCGTGGCCAAGAACATCGTGGCCAAACGCCGGAGCACCACTGCCACCGTCATCGT 

CTACGTGAATGGCGGCTGGTCCAGCTGGGCAGAGTGGTCACCCTGCTCCAACCGCTGTGGCCGAGGCTGGCAGAA 

GCGCACCCGGACCTGCACCAACCCCGCTCCACTCAACGGAGGGGCCTTCTGCGAGGGCCAGGCATTCCAGAAGAC 

CGCCTGCACCACCATCTGCCCAGTCGATGGGGCGTGGAGGGAGTGGAGCAAGTGGTCAGCCTGCAGCACTGAGTG 

TGCCCACTGGCGTAGCCGCGAGTGCATGGCGCCCCCACCCCAGAACGGAGGCCGTGACTGCAGCGGGACGCTGCT 

CGACTCTAAGAACTGCACAGATGGGCTGTGCATGCAAAATAAGAAAACTCTAAGCGACCCCAACAGCCACCTGCT 

GGAGGCCTCAGGGGATGCGGCGCTGTATGCGGGGCTCGTGGTGGCCATCTTCGTGGTCGTGGCAATCCTCATGGC 

GGTGGGGGTGGTGGTGTACCGCCGCAACTGCCGTGACTTCGACACAGACATCACTGACTCATCTGCTGCCCTGAC 

TGGTGGTTTCCACCCCGTCAACTTTAAGACGGCAAGGCCCAGCAACCCGCAGCTCCTACACCCCTCTGTGCCTCC 

TGACCTGACAGCCAGCGCCGGCATCTACCGCGGACCCGTGTATGCCCTGCAGGACTCCACCGACAAAATCCCCAT 

GACCAACTCTCCTCTGCTGGACCCCTTACCCAGCCTTAAGGTCAAGGTCTACAGCTCCAGCACCACGGGCTCTGG 

GCCAGGCCTGGCAGATGGGGCTGACCTGCTGGGGGTCTTGCCGCCTGGCACATACCCTAGCGATTTCGCCCGGGA 

CACCCACTTCCTGCACCTGCGCAGCGCCAGCCTCGGTTCCCAGCAGCTCTTGGGCCTGCCCCGAGACCCAGGGAG 

CAGCGTCAGCGGCACCTTTGGCTGCCTGGGTGGGAGGCTCAGCATCCCCGGCACAGGGGTCAGCTTGCTGGTGCC 

CAATGGAGCCATTCCCCAGGGCAAGTTCTACGAGATGTATCTACTCATCAACAAGGCAGAAAGTACCCTCCCGCT 

TTCAGAAGGGACCCAGACAGTATTGAGCCCCTCGGTGACCTGTGGACCCACAGGCCTCCTGCTGTGCCGCCCCGT 

CATCCTCACCATGCCCCACTGTGCCGAAGTCAGTGCCCGTGACTGGATCTTTCAGCTCAAGACCCAGGCCCACCA 

GGGCCACTGGGAGGAGGTGGTGACCCTGGATGAGGAGACCCTGAACACACCCTGCTACTGCCAGCTGGAGCCCAG 

GGCCTGTCACATCCTGCTGGACCAGCTGGGCACCTACGTGTTCACGGGCGAGTCCTATTCCCGCTCAGCAGTCAA 

GCGGCTCCAGCTGGCCGTCTTCGCCCCCGCCCTCTGCACCTCCCTGGAGTACAGCCTCCGGGTCTACTGCCTGGA 

GGACACGCCTGTAGCACTGAAGGAGGTGCTGGAGCTGGAGCGGACTCTGGGCGGATACTTGGTGGAGGAGCCGAA 

ACCGCTAATGTTCAAGGACAGTTACCACAACCTGCGCCTCTCCCTCCATGACCTCCCCCATGCCCATTGGAGGAG 

CAAGCTGCTGGCCAAATACCAGGAGATCCCCTTCTATCACATTTGGAGTGGCAGCCAGAAGGCCCTCCACTGCAC 

TTTCACCCTGGAGAGGCACAGCTTGGCCTCCACAGAGCTCACCTGCAAGATCTGCGTGCGGCAAGTGGAAGGGGA 

GGGCCAGATATTCCAGCTGCATACCACTCTGGCAGAGACACCTGCTGGCTCCCTGGACACTCTCTGCTCTGCCCC 

TGGCAGCACTGTCACCACCCAGCTGGGACCTTATGCCTTCAAGATCCCACTGTCCATCCGCCAGAAGATATGCAA 

CAGCCTAGATGCCCCCAACTCACGGGGCAATGACTGGCGGATGTTAGCACAGAAGCTCTCTATGGACCGGTACCT 

GAATTACTtTGCCACCAAAGCGAGCCCCACGGGTGTGATCCTGGACCTCTGGGAAGCTCTGCAGCAGGACGATGG 

GGACCTCAACAGCCTGGCGAGTGCCTTGGAGGAGATGGGCAAGAGTGAGATGCTGGTGGCTGTGGCCACCGACGG 

GGACTGCTGAGCCTCCTGGGACAGCGGGCTGGCAGGGACTGGCAGGAGGCAGGTGCAGGGAGGCCTGGGGCAGCC 

TCCTGATGGGGATGTTTGGCCTCTGCTTCCTCCCAGTTCACAGCCAGAGTTGCCTCTCCTCCTCCTCTTCCCCAA 

CCCCCAGACCATGACCAGCCTTAGAAAATCCATGTACTCTGTTGTTAGAGGGCCCAGAGTTCCTTCTCCACCCCC 

GCTCTCTCTCTCTTGGCCTGAGATCTCTGTGCAGGAACCAAGATGGGGCTGAAGCCTCTGGAGGCAGTTGGTTGG 

GGGCGGGCAGGCAGGAGGCCCTCCCTCCACCCCCCCACCCTCAGCCCGGCAACTTCTGGGTTCCGTGGGTTTTAG 

TTCCGTTCTTCGTTTTCTTCCTCCGTTATTGATTTCTCCTTTCTCCCTAAGCCCCCTTCTGCTTCCACGCCCTTT 

TCCTCTTTGAAGAGTCAAGTAC7VATTCAGACAAACTGCTTTCTCCTGTCCAAAAGCAAAAAGGCAAAGGAAAGAA 

AGAAAGCTTCAGACCGCTAGTAAGGCTCAAAGAAGAAGAAAAACACCAAAACCACAAGGGAAAAGAAAAACCCAG 

TTTCTTAGGAAACGCAAACGATTTATTATCCAGATTATTTGGATAAGTCCTTTTTAAAA 
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146/550 

FIGURE 146 

MGARSGARGALLIALLLCWDPRLSQAGTDSGSEVLPDSFPSAPAEPLPYFLQEPQDAYIVKNK 
PVELRCRAFPATQIYFKCNGEWVSQNDHVTQEGLDEATGLRVREVQIEVSRQQVEELFGLEDY 
WCQCVAWSSAGTTKSRRAYVRIAYLRKNFDQEPLGKEVPLDHEVLLQCRPPEGVPVAEVEWLK 
NEDVIDPTQDTNFLLTIDHNLIIRQARLSDTANYTCVAKNIVAKRRSTTATVIVYVNGGWSSW 
AEWSPCSNRCGRGWQKRTRTCTNPAPLNGGAFCEGQAFQKTACTTICPVDGAWTEWSKWSACS 
TECAHWRSRECMAPPPQNGGRDCSGTLLDSKNCTDGLCMQNKKTLSDPNSHLLEASGDAALYA 
GLWAIFVVVAILMAVGVVVYRRNCRDFDTDITDSSAALTGGFHPVNFKTARPSNPQLLHPSV 
PPDLTASAGIYRGPVYALQDSTDKIPMTNSPLLDPLPSLKVKVYSSSTTGSGPGLADGADLLG 
VLPPGTYPSDFARDTHFLHLRSASLGSQQLLGLPRDPGSSVSGTFGCLGGRLSIPGTGVSLLV 
PNGAIPQGKFYEMYLLINKAESTLPLSEGTQTVLSPSVTCGPTGLLLCRPVILTMPHCAEVSA 
RDWIFQLKTQAHQGHWEEWTLDEETLNTPCYCQLEPRACHILLDQLGTYVFTGESYSRSAVK 
RLQLAVFAPALCTSLEYSLRVYCLEDTPVALKEVLELERTLGGYLVEEPKPLMFKDSYHNLRL 
SLHDLPHAHWRSKLLAKYQEIPFYHIWSGSQKALHCTFTLERHSLASTELTCKICVRQVEGEG 
QIFQLHTTLAETPAGSLDTLCSAPGSTVTTQLGPYAFKIPLSIRQKICNSLDAPNSRGNDWRM 
LAQKLSMDRYLNYFATKASPTGVILDLWEALQQDDGDLNSLASALEEMGKSEMLVAVATDGDC 

Important features of the protein: 

Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 374-395 

N-glycosylation sites. 

amino acids 222-225, 347-350 
Glycosaminoglycan attachment site. 

amino acids 492-4 95 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 233-236, 234-237 

Casein kinase II phosphorylation sites. 

amino acids 30-33, 87-90, 251-254, 341-344, 359-362, 629-632, 651-654, 706- 
709, 757-760, 827-830, 925-928, 941-944 
Tyrosine kinase phosphorylation sites, 
amino acids 216-223, 773-780 
N-myristoylation sites. 

amino acids' 2-7 , 6-11, 27-32, 96-101, 137-142, 179-184, 247-252, 281-286, 334- 
339, 379-384, 491-496, 495-500, 509-514, 542-547, 547-552, 550-555, 553-558, 
560-565, 611-616, 785-790, 834-839, 844-849 
Prokaryotic membrane lipoprotein lipid attachment site, 
amino acids 541-551 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 926-933 

Growth factor and cytokines receptors family signature 2. 

amino acids 306-312 
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147/550 
FIGURE 147 

GAGAGGGACAGAGGCTGGAGAAGGATGTATGGCCTGCCCTGGGCTTGTCTGTTCCCTCCTGAGCCTGAGCCCCTT 
ACCTTCCTGACCCCATGAAGCACACACTGGCTCTGCTGGCTCCCCTGC 

rTrAGCTGGCTGCAGGGGCCACAGACTGCAAGTTCCTTGGCCCGGCAGAGCACCTGACATTCACCCCAGCAGCCA 

ggg?ccgg?gg?Sg?S 

TCTCGGTGGTGCAGCTCAATCCTTTCCCTTCAGAGTTGGTAAAGGCCCTACTGAATGAGCTGGCCTCCGTGAAGG 

tgSgaggtggtgcggtacgaggcgggctacgtggtatgcgctgtgatcgcgggcctctacctgctg 

CCMTGCCGGGCTTTGCTTCTGCTGCTGCCGCTGCCACCGGCGCTGCGGGGGACGAGTGAAGACAGAGC 

cgctgIcctctgagcgcgcggccctcatggtcttcctgctgctgaccaccctctt 

GTGCCTTTGTCACCAACCAGCGCACGCATGAACAGATGGGCCCCAGCATCGAGGCCATGCCTGA^ 

GCCTCTGGGGCCTGGTCTCTGATGTCCCCCAAGAGCTGCAGGCCGTGGCACAGCAATTCTCCCTGCCCCAGGAGC 

AAGTCTCAGAGGAGCTGGATGGTGTTGGTGTGAGCATTGGGAGCGCGATCCACACTCAGCTCAGGAGCTC^ 

A^CCCTTGCTGGCGGCCGTGGGCAGTTTGGGCCAGGTCCTGCAGGTCTCCGTGCACCACCTGCAA^ 

CTACAGTGGTAGAGCTGCAGGCCGGGCAGCAGGACCTGGAGCCAGCCATCCGGGAACACCGGGACCGCCTC^ 

StgctgcaIaggccaggtgccagggagattgtgcaggggccctgagctgggcccgcaccc^ 

CTGACTTCAGCCAGGTGCCCTCTGTGGACCATGTCCTGCACCAGCTAAAAGGTGTCCCCGAGGCCAACTTCTCCA 

Satggtccaggaggagaacagcacc^ 

SIgSgaagaaggcagIggcccagcagccggaaggggtgagga^ 

c^gcttcccgctcggcccaggcactgcaggaggtggaggagagcagccgcccctacctgcaggaggtg^ 

acgagacctacaggtggatcgtgggctgcgtgctgtgctccgtggtcctattcgtggtgctctgcaacctg^ 

gcctcaatctgggcatctggggcctgtctgccagggacgaccccag 

cccgotcctcatggcaggtgtgggcctcagcttcctctttgct 

ccccagggaacctgcccccgtccatgaacctgtcgcaacttcttggcctgaggaagaacatcagcatcc 
ccSSSSaaggaagggg^^ 

ACCTGGATATCAACCAGTATACCAACAAGCTACGGCAGGAGT^ 

ACCTGCTGAGCrcAGCCGCCCGCCGGGACCTGGAGGCCCTGCAGAGCAGTGGGCTTCAGCGCATCCACTA^ 

aStcctcg?tcagatccagaggcccgtggtgaagaccagcatggagcagctggccca^ 

CCCAGGCCCAAGACAATTCTGTGCTGGGGCAGCGGCTGCAGGAGGAGGCCCAAGGACTCAGAAACCTTCACCAGG 

agSgctcgtcSccIgcagag^ 

ATMCCAGCTGGAGACCTCAGATGTCCTAGCCAATGTCACCTACCTGAAAGGAGAGCTGCCTGCCTGGGCAGCCA 

ggScc?gaggaa?gtg^ 

Sgaggtga^tcagcgcattgccacctgccagcccctctccggagccctggacaacagccgtgtgat^ 

gtgacatgatggctgacccctggaatgccttctggttctgcctggcatggtgcaccttcttcctgatcccc^ 

tcatctttgccgtcaagacctccaaatacttccgtcctatccggaaacgcctcagctccaccagctctga 

ctcagctcttccacatcccccgggttacctccctgaagctg^gggccttgtggggtgaggtgaccctgaggctg 

cctgtcctcccctttgatttagcctgggccacaggacttcggtagctcttgccccagagcccaggctggcatcca 

gSSggSSccccagtt^ 
cc^catcattcacgctcagaatcacatg^ 

cgSacc?ccatgctggtggcatcctcacaggaagagcctgttctccacctgctggagcctggaccc 

rACAGAGGCCTCGTCCAACCCCACTCCCCTTCCCGTGTGTCTTCCCCCTGCCAAGCCTCCCCCTGCCAAGCCTCC 

cSSccSSSgag^ccctc^ 

atgScttcctggccctttgcttcctcccttagtcccctcttcaccatatctccactgctaccttgctggccc^ 
JIgaccaccctgcccaaccaaaccactcag 

ctggctgcaggccctgcctcatggcctctgagccctccactgccccagggccttgggccctctgcag^^ 

?agga??S?tgttgtccagtgggg 
a^tgggggSccagcacttagct 

g^ggcttgcccttacctcttcagaaggaagcatcttccacagcccccacccaactttcttaggagtgatctggt 

ggcSgaISggattttgcacggccccttttatcc 

tSScctgagtcS 

attaaagtgttctcacttcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
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148/550 
FTftTIRE 148 

MKHTLALLAPLLGLGLGLALSQLAAGATDCKFLGPAEHLTFTPAARARWLAPRVRAPGLL 

DSLYGTVRRFLSWQLNPFPSELVKALLNELASVKVNEVVRYEAGYVVCAVIAGLYLLLV 

PTAGLCFCCCRCHRRCGGRVKTEHKALACERAALMVFLLLTTLLLLIGVV^ 

EQMGPS IEAMPETLLSLWGLVSDVPQELQAVAQQFSLPQ.EQVSEELDGVGVS IGSAIHTQ 

LRSSVYPLLAAVGSLGQVLQVSVHHLQTLNATWELQAGQQDLEPAIREHRDRLLELLQE 

ar?qgdcag^swart£elgadfsqvpsvdhvlhqlkgvpea N fssmv^ 

LAAMOTSSWQELKKAVAQQPEGVRTLAEGFPGLEAASRWAQALQEVEESSRPYLQEVQR 
YETYRWIVGCVLCSWLFVVLCNLLGLNLGIWGLSARDDPSHPEAKGEAGARTLMAGVGL 
SFLFAAPLILLVFATFLVGGNVQTLVCRSWENGELFEFADTPGNLPPSMNLSQLLGLRKN 
I S IHQAYQQCKEGAALWTVLQLNDS YDLEEHLDINQYTNKLRQELQSLKVDTQSLDLLSS 
AARRDLEALQSSGLQRIHYPDFLVQIQRPWKTSMEQLAQELQGLAQAQDNSVLGQRLQE 
EAQGLRNLHQEKWPQQSLVAKLNLSVRALESSAPNLQLETSDVLANVTYLKGELPAWAA 

rSvSEC^ 

FWFCLAWCTFFLIPSIIFAVKTSKYFRPIRKRLSSTSSEETQLFHIPRVTSLKL 

Signal peptide: 

amino acids 1-17 
Transmembrane domain: 

amino acids 105-125, 153-173, 428-449, 476-500, 778-797 
N-glycosylation sites: 

amino acids 270-273, 343-347, 352-356, 530-534, 540-546, 563-567, 
684-688, 707-711, 725-729 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 811-815 

Tyrosine kinase phosphorylation site. 

. amino acids 95-103 
N-myristoylation sites. 

amino acids 13-19, 15-21, 17-23, 26-32, 58-64, 124-130, 168-174, 
228-234, 230-236, 320-326, 338-344, 393-399, 429-435, 446-452, 
477-483, 500-506, 536-542, 644-650, 761-767 
Phospholipase A2 histidine active site, 
aminop acids 129-137 

4Fe-4S ferredoxins, iron-sulfur binding region signature. 

amino acids 126-138 

Mitochondrial energy transfer proteins signature. 

amino acids 80-89 
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149/550 
FIGURE 149 

CACAGCTCCCTTCCCAGGACGTGAAAATCTGCCTTCTCACCATGAGGCTTCTAGTCCTTTCCA 
GCCTGCTCTGTATCCTGCTTCTCTGCTTCTCCATCTTCTCCACAGAAGGGAAGAGGCGTCCTG 
CCAAGGCCTGGTCAGGCAGGAGAACCAGGCTCTGCTGCCACCGAGTCCCTAGCCCCAACTCAA 
CAAACCTGAAAGGACATCATGTGAGGCTCTGTAAACCATGCAAGCTTGAGCCAGAGCCCCGCC 
TTTGGGTGGTGCCTGGGGCACTCCCACAGGTGTA6CACTCCCAAAGCAAGACTCCAGACAGCG 
GAGAACCTCATGCCTGGCACCTGAGGTACCCAGCAGCCTCCTGTCTCCCCTTTCAGCCTTCAC 
AGCAGTGAGCTGCAATGTTGGAGGGCTTCATCTCGGGCTGCAAGGACCCTGGGAAAGTTCCAG 
AACTCCACGTCCTTGTCTCAATTGTGCCATCAACTTTCAGAGCTATCATGAGCCAACCTCACC 
CCACAGGGCCTCAGTCGCCACCATGTGGGCCTCTCCAGTGCAAACCACCGAGCATTCCACCAT 
GACCGGTCACAGCTACAAATCCAGAGACCATCAATCCTGCTAGAGTGCAGGGTGGCAAGCACC 
CAAGGGTGGCTGACCAAGACTGCAGAGTCTCCTCCATCTTCAGGTCCATTCAGCCTCCTGGCA 
TTTAACTACCAGCATCCAGTGGTCCCCAAGGAATCCCTTCCTAGCCTCCTGACATGAGTCTGC 
T GG AAAGAGC AT C C AAAC AAAC AAGT AAT AAAT AAAT AAAT AAAC T C A 
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150/550 
FIGURE 150 

MRLLVLSSLLCILLLCFSIFSTEGKRRPAKAWSGRRTRLCCHRVPSPNSTNLKGHHVRLCKPC 
KLEPEPRLWVVPGALPQV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation site. 

amino acids 48-52 

Ami da tion sites. 

amino acids 23-27, 33-37 
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151/550 



FIGURE 151 

CACCGGAGGGCACGCAGCTGACGGAGCTGCGCTGCGTTCGCCTCGTTTGCCTCGCGCCCTCCA 

CTGGAGCTGTTCGCGCCTCCCGGCTCCCACCGCAGCCCACCCGGCAGAGGAGTCGCTACCAGC 

GCCCAGTGCGCTCTGTCAGTCCGCAAACTCCTTGCCGCCCGCCCCGGGCTGGGCACCAAATAC 

CAGGCTACCATGGTCTACAAGACTCTCTTCGCTCTTTGCATCTTAACTGCAGGATGGAGGGTA 

CAGAGTCTGCCTACATCAGCTCCTTTGTCTGTTTCTCTTCCGACAAACATTGTACCACCGACC 

ACCATCTGGACTAGCTCTCCACAAAACACTGATGCAGACACTGCCTCCCCATCCAACGGCACT 

CACAACAACTCGGTGCTCCCAGTTACAGCATCAGCCCCAACATCTCTGCTTCCTAAGAACATT 

TCCATAGAGTCCAGAGAAGAGGAGATCACCAGCCCAGGTTCGAATTGGGAAGGCACAAACACA 

GACCCCTCACCTTCTGGGTTCTCGTCAACAAGCGGTGGAGTCCACTTAACAACCACGTTGGAG 

GAACACAGCTCGGGCACTCCTGAAGCAGGCGTGGCAGCTACACTGTCGCAGTCCGCTGCTGAG 

CCTCCCACACTCATCTCCCCTCAAGCTCCAGCCTCATCACCCTCATCCCTATCAACCTCACCA 

CCTGAGGTCTTTTCTGCCTCCGTTACTACCAACCATAGCTCCACTGTGACCAGCACCCAACCC 

ACTGGAGCTCCAACTGCACCAGAGTCCCCGACAGAGGAGTCCAGCTCTGACCACACACCCACT 

TCACATGCCACAGCTGAGCCAGTGCCCCAGGAGAAAACACCCCCAACAACTGTGTCAGGCAAA 

GTGATGTGTGAGCTCATAGACATGGAGACCACCACCACCTTTCCCAGGGTGATCATGCAGGAA 

GTAGAACATGCATTAAGTTCAGGCAGCATCGCCGCCATTACCGTGACAGTCATTGCCGTGGTG 

CTGCTGGTGTTTGGAGTTGCAGCCTACCTAAAAATCAGGCATTCCTCCTATGGAAGACTTTTG 

GACGACCATGACTACGGGTCCTGGGGAAACTACAACAACCCTCTGTACGATGACTCCTAACAA 

TGGAATATGGCCTGGGATGAGGATTAACTGTTCTTTATTTATAAGTGCTTATCCAGTAGAATT 

AATAAGTACCTGATGCGCATTGAACGACAATCTTAAGCCCTGTTTTGTTGGTATGGTTGTTTT 

TGTTTTCCTCCCTCTCCTCTGGCTGCTACAACTTCCCCTTTCTGGTACAAGAAGAACCATTCT 

TTAAAGGTGAGTGGAGGCTGATTTGCAGCTGAAGTGGGCCAGCCTTGCACCAGCCAGGCCAGA 

CCACCATGGTGAAGGCTTCTTTCCCCACTGCAGGACCCACTTTGAGAAGGATCGAGGAGGAGG 

ATTTGGGTTGTTTTGTTAGGGGTTACTTTCAGGGGAACATTTCATTTGTGTTATTTCTTAAAC 

TTCTATTTAGGAAATTACATTAAGTATTAATGAGGGGAAAGGAAATGAGCTCTACGAGGATTT 

CACCTTGCATGGGAGAGAGCAGGGTTTTCTCAGATTCCTTTTTAATCTCTATTTATCTGGTTG 

TTTCTGACAGGATGCTGCCTGCTTGGCTCTACGAGCTGGAAAGCAGCTTCTTAGCTGCCTAAT 

TAATGAAAGATGAAAATAGGAAGTGCCCTGGAGGGGGCCAGCAGGTCACGGGGCAGAATCTCT 

CAGGTTGCTGTGGGATCTCAGTGTGCCCCTACCTGTTCTCCCCTCCAGGCCACCTGTCTCTGT 

AAAGGATGTCTGCTCTGTTCAAAAGGCAGCTGGGATCCCAGCCCACAAGTGATCAGCAGAGTT 

GCATTTCCAAAGAAAAAGGCTATGAGATGAGCTGAGTTATAGAGAGAAAGGGAGAGGCATGTA 

CGGTGTGGGGAAGTGGAAGAGAAGCTGGCGGGGGAGAAGGAGGCTAACCTGCACTGAGTACTT 

CATTAGGACAAGTGAGAATCAGCTATTGATAATGGCCAGAGATATCCACAGCTTGGAGGAGCC 

CAGAGACTGTTTGCTTTATACCCACACAGCAACTGGTCCACTGCTTTACTGTCTGTTGGATAA 

T GGCT GT AAAATGT TT AAAAAC 
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152/550 

FIGURE 152 

MVYKTLFALCILTAGWRVQSLPTSAPLSVSLPTNIVPPTTIWTSSPQNTDADTASPSNGTHNN 
SVLPVTASAPTSLLPKNISIESREEEITSPGSNWEGTNTDPSPSGFSSTSGGVHLTTTLEEHS 
SGTPEAGVAATLSQSAAEPPTLISPQAPASSPSSLSTSPPEVFSASVTTNHSSTVTSTQPTGA 
PTAPESPTEESSSDHTPTSHATAEPVPQEKTPPTTVSGKVMCELIDMETTTTFPRVIMQEVEH 
ALSSGSIAAITVTVIAWLLVFGVAAYLKIRHSSYGRLLDDHDYGSWGNYNNPLYDDS 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 258-278 

N-glycosylation sites . 

amino acids 58-61, 62-65, 80-83, 176-179 

Casein kinase II phosphorylation sites. 

# amino acids 49-52, 85-88, 95-98, 100-103,- 120-123, 121-124,- 141- 
144, 164-167, 191-194, 195-198, 200-203 

Tyrosine kinase phosphorylation site. 

amino acids 289-296 



N-myristoylation sites . 

amino acids 59-64, 115-120, 128-133, 133-138, 257-262, 297- 
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153/550 

FIGURE 153 

ACGTCACTGTCTTGAAGCAGCAGTAGCCTGGGAAGTGAGGCAGGAGGAATTGAGAGGCAGGAA 
GGGNGCTGGAGACACAGCTGAGCCTGGAAATGAGAGTGGGCATCGCCGTGGTCATCATGACTC 
CTCTGCGGCGTGGTCACCATGTTGGTTCACTGTGTTGGGCTCTTATTGACGGGTCTCCTGCTA 
GGCCTGACCTTGGGTGCCGGAGCCCTGCTGGCTTCTGAGCCTATCTACCAACCACCTTCAGCC 
TGGGTGCCAGCTGGGGGGCTGGTGGGGCTGGCGCTGCTGGGAGCCCTGCTCACACTTCGGTGG 
CCACGTCCATTCACAGTTCTGGGCACAACCCTGCTGGGTTCTGCAGTGCTTGTGGCCTGTGTT 
GACTACTTCCTGGAGGGGCTGGCACTGGGGAGTTGGCTGGGCCAACGCCTGCAGACACTTCCA 
GCCTTGCCTTCTCTCTGCTGATATAGCTGGGTCTTACTGGGGATCTGGCCAGCCTTGGGGGCC 
CTTGGAGCCCTGGCCCAGTGGi\AGCTCGTGCCTGAGGAACATGGAGGCCACGCTAATGGGTCT 
GTTCCTGGTTTCCCAGATGCATAAAGGAAGACATATCCCTCCCCTGGGCAGCAAGGCTACAAT 
GGGAGGGAGGGAGAACATGGGAGCATGTGAATAAAATGGCATTAAATACTGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA7VAAA 
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154/550 

FIGURE 154 

MLVHCVGLLLTGLLLGLTLGAGALLASEPIYQPPSAWVPAGGLVGLALLGALLTLRWPRPFTV 
LGTTLLGSAVLVACVDYFLEGLALGSWLGQRLQTLPALPSLC 

Signal peptide: 

amino acids 1-20 

Transmembrane domain : 

amino acids 38-55, 60-78 

N-myristoylation sites. 

amino acids 7-13, 12-18, 16-22, 22-28, 41-47, 50-56, 84-90, 88-94 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 67-78 
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155/550 

FT CURE 155 

TGCAATTAAAGGAGTCGGGTCTCTAACTGTTGATCTGTTTTTTTCCCTTCTGAGCAATGGAGC 
TTACCATCTTTATCCTGAGACTGGCCATTTACATCCTGACATTTCCCTTGTACCTGCTGAACT 
TTCTGGGCTTGTGGAGCTGGATATGCAAAAAATGGTTCCCCTACTTCTTGGTGAGGTTCACTG 
TGATATACAACGAACAGATGGCAAGCAAGAAGCGGGAGCTCTTCAGTAACCTGCAGGAGTTTG 
CGGGCCCCTCCGGGAAACTCTCCCTGCTGGAAGTGGGCTGTGGCACGGGGGCCAACTTCAAGT 
TCTACCCACCTGGGTGCAGGGTGACCTGTATTGACCCCAACCCCAACTTTGAGAAGTTTTTGA 
TCAAGAGCATTGCAGAGAACCGACACCTGCAGTTTGAGCGCTTTGTGGTAGCTGCCGGGGAGA 
ACATGCACCAGGTGGCTGATGGCTCTGTGGATGTGGTGGTCTGCACCCTGGTGCTGTGCTCTG 
TGAAGAACCAGGAGCGGATTCTCCGCGAGGTGTGCAGAGTGCTGAGACCGGGAGGGGCTTTCT 
ATTTCATGGAGCATGTGGCAGCTGAGTGTTCGACTTGGAATTACTTCTGGCAACAAGTCCTGG 
ATCCTGCCTGGCACCTTCTGTTTGATGGGTGCAACCTGACCAGAGAGAGCTGGAAGGCCCTGG 
AGCGGGCCAGCTTCTCTAAGCTGAAGCTGCAGCACATCCAGGCCCCACTGTCCTGGGAGTTGG 
TGCGCCCTCATATCTATGGATATGCTGTGAAATAGTGTGAGCTGGCAGTTAAGAGCTGAATGG 
CTCAAAGAATTTAAAGCTTCAGTTTTACATTTAAAATGCTAAGTGGGAGAAGAGAAACCTTTT 
TTTTGGGGGGCGGTTTTTTTGGTTTGTTGTTGGTTTTTTTTTTTTTTTTGGCAGGAGAATCTC 
TTGAACCCAGAAGGCGAAGGTTGCAGTGAACCGAGATCATGCCATTGTACTCTAGCCTGGGTG 
ACAAG AGC AAGACT CC GT CT C AAAAAAAAAAAAAAAAAAAAAAAG AAGT AGAG AC AGGGAGAC 
GGGGTCTCACTGTGTTGCCTAGGCCGGTCTTGAACTCCTGGGCTCAAGTGATTCTCCCACCTT 
GACCTCCTAAATTGTTGGGATTACAGGTGTGAGACAGTGCACCTGGCCGAAATAGCTCAAGTT 
TCTGAAAAACAAATCTGAATCTATTTGTTATTCTTAGCGTCACTGGTCTGGCTTTCAGAATTA 
ACATACAAGGTTGCCACACCTAGTTCTGCCGAGCTTTATGTCTTTTATTCCAGTATTCCACCA 
AAGTTTGTTTTCCTGCATTCCAGTTCTCAAGTCTTAAGATAAAGATTGTACTTGACAGTTTAG 
TATATCCATAAAACTATTTGAGGTGGTTAAGGTTCTTGGGTTCATTTTCCTTAATACTTTGCT 
GAATATTGTAGATTGTAGGCAATGAAAAAGTCTACTAAATTAGGAAAACCTTGAATAATTAGG 
TATCCTAGGTAAGAGCCCCTAAACATCAAGCAATCTGTGAGTCTGTAAAGAAATAAATATTTT 
. TTGGATTATTCTTATCTAATTCCACCCCTGTTGGAAGATGATTTCTTTGTTCTTTGCAACTAT 
GGAAGCTGTGAAAATCATCACAAGTGCCTCTGAAAGCGAGTGTTAGGTTGGTTAGAGGGTTTA 
ATATTTTCTGCAATGGTTTGTAGGAATTTTAATAAATGTAGTATATTTTCTGAGATGATTTTG 
TAAAAGTACTATTTTAAATATCAAATCAACCAATAAATTCACATTTGTGTTAGGAACAAAA 
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156/550 
FIGURE 156 

MELT I FI LRLAI Y I LT FPLYLLN FLGLWSW I CKKWFPY FL VRFT VI YNEQMAS KKRELFSNLQ 
EFAGPSGKLSLLEVGCGTGANFKFYPPGCRVTCIDPNPNFEKFLIKSIAENRHLQFERFWAA 
GENMHQVADGSVDVWCTLVLCSVKNQERILREVCRVLRPGGAFYFMEHVAAECSTWNYFWQQ 
VLDPAWHLL FDGCNLTRE SWKALERAS FSKLKLQH IQAPLSWELVRPH I YG YAVK 

Signal peptide: 

amino acids 1-29 

N-glycosylation site. 

amino acids 203-207 

N-myristoylation sites. 

amino. acids 78-84, 80-86, 91-97, 201-207 
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157/550 
FTGURE 157 

CCGCTGAGATGTACGAACTTCCGGTTCTCCGGGCAGCTGCCACTGCTGTAGCTTCTGCCACCT 

GCCACGACCGGGCCTCTCCCTGGCGTTTGGTCACCTCTGCTTCATTCTCCACCGCGCCTATGG 

TCCCTCTTGGAGCCAGCGTGGCGGGCCTGGCGGCTCCCGGGTGGTGAGAGAGCGGTCCGGGAA 

CGATGAAGGCCTCGCAGTGCTGCTGCTGTCTCAGCCACCTCTTGGCTTCCGTCCTCCTCCTGC 

TGTTGCTGCCTGAACTAAGCGGGCCCCTGGCAGTCCTGCTGCAGGCAGCCGAGGCCGCGCCAG 

GTCTTGGGCCTCCTGACCCTAGACCACGGACATTACCGCCGCTGCCACCGGGCCCTACCCCTG 

CCCAGCAGCCGGGCCGTGGTCTGGCTGAAGCTGCGGGGCCGCGGGGCTCCGAGGGAGGCAATG 

GCAGCAACCCTGTGGCCGGGCTTGAGACGGACGATCACGGAGGGAAGGCCGGGGAAGGCTCGG 

TGGGTGGCGGCCTTGCTGTGAGCCCCAACCCTGGCGACAAGCCCATGACCCAGCGGGCCCTGA 

CCGTGTTGATGGTGGTGAGCGGCGCGGTGCTGGTGTACTTCGTGGTCAGGACGGTCAGGATGA 

GAAGAAGAAACCGAAAGACTAGGAGATATGGAGTTTTGGACACTAACATAGAAAATATGGAAT 

TGACACCTTTAGAACAGGATGATGAGGATGATGACAACACGTTGTTTGATGCCAATCATCCTC 

GAAGATAAGAATGTGCCTTTTGATGAAAGAACTTTATCTTTCTACAATGAAGAGTGGAATTTC 

TATGTTTAAGGAATAAGAAGCCACTATATCAATGTTGGGGGGGTATTTAAGTTACATATATTT 

TAACAACCTTTAATTTGCTGTTGCAATAAATACCGTATCCTTTTATTATATCTTTATATGTAT 

AGAAGTACTCTATTAATGGGCTCAGAGATGTTGGGGATAAAGTATACTGTAATAATTTATCTG 

TTTGAAAATTACTATAAAACGGTGTTTTCTGGTCGGTTTTTGTTTCCTGCTTACCATATGATT 

GTAAATTGTTTTATGTATTAATCAGTTAATGCTAATTATTTTTGCTGATGTCATATGTTAAAG 

AGCTATAAATTCCAACAACCAACTGGTGTGTAAAAATAATTTAAAATTTCCTTTACTGAAAGG 

TATTTCCCATTTTTGTGGGGAAAAGAAGCCAAATTTATTACTTTGTGTTGGGGTTTTTAAAAT 

ATTAAGAAATGTCTAAGTTATTGTTTGCAAAACAATAAATATGATTTTAAATTCTCTTAAAAA 

AAAAA 



WO 01/40466 



PCT7USO0/32678 



158/550 

FIGURE 158 

MKASQCCCCLSHLLASVLLLLLLPELSGPLAVLLQAAEAAPGLGPPDPRPRTLPPLPPGPTPA 
QQPGRGLAEAAGPRGSEGGNGSNPVAGLETDDHGGKAGEGSVGGGLAVSPNPGDKPMTQRALT 
VLMVVSGAVLVYFWRTVRMRRRNRKTRRYGVLDTNIENMELTPLEQDDEDDDNTLFDANHPRR 

Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 124-140 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 69-75, 78-84, 81-87, 97-103, 103-109, 106-112, 
157-160 
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159/550 

FT CURE 159 

GCTGCAGGCGGCGACGGCTACACCATGGGCCGGCTGCTGCGGGCCGCCCGGCTGCCGCCGCTG 
CTTTCGCCGCTGCTGCTTCTGCTGGTTGGGGGAGCGTTCCTGGGTGCCTGTGTGGCTGGGTCT 
GATGAGCCTGGCCCAGAGGGCCTCACCTCCACCTCCCTGCTAGACCTCCTGCTGCCCACTGGC 
TTGGAGCCACTGGACTCAGAGGAGCCTAGTGAGACCATGGGCCTGGGAGCTGGGCTGGGAGCC 
TCTGGCTCAGGCTTCCCCAGCGAAGAGAATGAAGAGTCTCGGATTCTGCAGCCACCACAGTAC 
TTCTGGGAAGAGGAGGAAGAGCTGAATGACTCAAGTCTGGACCTGGGACCCACTGCAGATTAT 
GTTTTTCCTGACTTAACTGAGAAGGCAGGTTCCATTGAAGACACTAGCCAGGCTCAAGAGCTG 
CCAAACCTCCCCTCTCCCTTGCCCAAGATGAAT.CTGGTTGAGCCTCCCTGGCATATGCCTCCC 
AGAGAGGAGGAAGAAGAGGAAGAGGAAGAGGAGGAGAGGGAGAAGGAAGAGGTAGAGAAACAA 
GAGGAGGAGGAAGAGGAGGAGCTGCTCCCTGTGAATGGATCCCAAGAAGAAGCCAAGCCTCAG 
GTCCGTGACTTTTCTCTCACCAGCAGCAGCCAGACCCCAGGGGCCACCAAAAGCAGGCATGAA 
GACTCCGGGGACCAGGCCTCATCAGGTGTGGAGGTGGAGAGCAGCATGGGGCCCAGCTTGCTG 
CTGCCTTCAGTCACCCCAACTACAGTGACTCCGGGGGACCAGGACTCCACCAGCCAAGAGGCA 
GAGGCCACAGTGCTGCCAGCTGCAGGGCTTGGGGTAGAGTTCGAGGCTCCTCAGGAAGCAAGC 
GAGGAAGCCACTGCAGGAGCAGCTGGTTTGTCTGGCCAGCACGAGGAGGTGCCGGCCTTGCCT 
TCATTCCCTCAAACCACAGCTCCCAGTGGGGCCGAGCACCCAGATGAAGATCCCCTTGGCTCT 
AGAACCTCAGCCTCTTCCCCACTGGCCCCTGGAGACATGGAACTGACACCTTCCTCTGCTACC 

ATACCCTGGGATTCTACGCAGGTGATCTGCAAGGACTGGAGCAATCTGGCTGGGAAAAACTAC 
ATCATTCTGAACATGACAGAGAACATAGACTGTGAGGTGTTCCGGCAGCACCGGGGGCCACAG 
CTCCTGGCCCTGGTGGAAGAGGTGCTGCCCCGCCATGGCAGTGGCCACCATGGGGCCTGGCAC 

a?S??c5gag?aagcccagcgagaaggagcagca^ 

GGGGTGGTGCCCACTCAAGATGTCCTTTCCATGCTGGGTGACATCCGCAGGAGCCTGGAGGAG 
ATTGGCATCCAGAACTATTCCACAACCAGCAGCTGCCAGGCGCGGGCCAGCCAGGTGCGCAGC 

gSacggSc^ 

CTTGGCCTGCTCTACAACTGCTGGCAGCGCCGGCTGCCCAAGCTCAAGCACGTGTCGCACGGC 

gaggagSgcgcttcgtggagaa 

AGCCAGTCGGAGATGCAGGAGAAGCACCCCAGCCTGAACGGCGGCGGGGCCCTCAACGGCCCG 

gggaggtggLggcgctcatggggggcaagcgggaccccgaggactcggacg 

GACACGCACCTGTGAGCGCAGCCGAGGCGCAGGCCGAGTGGGCCGCCAGGACCAAGCGAGGTG 
GACCCCGAAACGGACGGCCCGGAGCCCGCACCAGCCCCGCGCCTACCCGGGCCGCCCCCGCGG 

Stggccc^ggcgcgggctccttcccgcttcccccgacttcacacggcggcttcggaccaac 

TCCCTCACTC^CGCCCGAGGGGCAGGCCTCAAAGCCCGCCTTGGCCCCGCTTTCCCGCCCCTG 
AACCCCGGCCCCGCGGGCGGCGGGCGGCGCTTCCTGCGCCCCGGGACTCAATTAAACCCGCCC 

GGAGACCACGCCGGGCCCAGCAAAA 
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160/550 
FIGURE 160 

MGRLLRAARLPPLLSPLLLLLVGGAFLGACVAGSDEPGPEGLTSTSLLDLLLPTGLEPLDSEE 
PSETMGLGAGLGASGSGFPSEENEESRILQPPQYFWEEEEELNDSSLDLGPTADYVFPDLTEK 
AGSIEDTSQAQELPNLPSPLPKMNLVEPPWHMPPREEEEEEEEEEEREKEEVEKQEEEEEEEL 
LPVNGSQEEAKPQVRDFSLTSSSQTPGATKSRHEDSGDQASSGVEVESSMGPSLLLPSVTPTT 
VTPGDQDSTSQEAEATVLPAAGLGVEFEAPQEASEEATAGAAGLSGQHEEVPALPSFPQTTAP 
SGAEHPDEDPLGSRTSASSPLAPGDMELTPSSATLGQEDLNQQLLEGQAAEAQSRIPWDSTQV 
ICKDWSNLAGKNYIILNMTENIDCEVFRQHRGPQLLALVEEVLPRHGSGHHGAWHISLSKPSE 
KEQHLLMTLVGEQGWPTQDVLSMLGDIRRSLEEIGIQNYSTTSSCQARASQVRSDYGTLFW 
LWIGAICIIIIALGLLYNCWQRRLPKLKHVSHGEELRFVENGCHDNPTLDVASDSQSEMQEK 

HPSLNGGGALNGPGSWGALMGGKRDPEDSDVFEEDTHL 

Signal peptide: 

amino acids 1-29 

Transmembrane domain : 

amino acids 499-521 

N-glycosylation sites. 

amino acids 106-110, 193-197, 395-399, 480-484 

Giycosaminoglycan attachment site. 

amino acids 77-81 

N-myristoylation sites. 

amino acids 24-30, 28-34, 41-47, 69-75, 71-77, 73-79, 75-81, 
216-222, 327-333, 455-461, 519-525, 574-580, 581-587, 584-590 



Amidation site. 

amino acids 588-592 
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161/550 
FIGURE 161 

CCAGGGCGGAGCGCAGCTGCGCCGGGCTTGGGCGCCTGGGGCCGCCGCTCCCCACCGTCGTTT 

TCCCCACCGAGGCCGAGGCGTCCCGGAGTCATGGCCGGCCTGAACTGCGGGGTCTCTATCGCA 

CTGCTAGGGGTTCTGCTGCTGGGTGCGGCGCGCCTGCCGCGCGGGGCAGAAGCTTTTGAGATT 

GCTCTGCCACGAGAAAGCAACATTACAGTTCTCATAAAGCTGGGGACCCCGACTCTGCTGGCA 

AAACCCTGTTACATCGTCATTTCTAAAAGACATATAACCATGTTGTCCATCAAGTCTGGAGAA 

AGAATAGTCTTTACCTTTAGCTGCCAGAGTCCTGAGAATCACTTTGTCATAGAGATCCAGAAA 

AATATTGACTGTATGTCAGGCCCATGTCCTTTTGGGGAGGTTCAGCTTCAGCCCTCGACATCG 

TTGTTGCCTACCCTCAACAGAACTTTCATCTGGGATGTCAAAGCTCATAAGAGCATCGGTTTA 

GAGCTGCAGTTTTCCATCCCTCGCCTGAGGCAGATCGGTCCGGGTGAGAGCTGCCCAGACGGA 

GTCACTCACTCCATCAGCGGCCGAATCGATGCCACCGTGGTCAGGATCGGAACCTTCTGCAGC 

AATGGCACTGTGTCCCGGATCAAGATGCAAGAAGGAGTGAAAATGGCCTTACACCTCCCATGG 

TTCCACCCCAGAAATGTCTCCGGCTTCAGCATTGCAAACCGCTCATCTATAAAACGTCTGTGC 

ATCATCGAGTCTGTGTTTGAGGGTGAAGGCTCAGCAACCCTGATGTCTGCCAACTACCCAGAA 

GGCTTCCCTGAGGATGAGCTCATGACGTGGCAGTTTGTCGTTCCTGCACACCTGCGGGCCAGC 

GTCTCCTTCCTCAACTTCAACCTCTCCAACTGTGAGAGGAAGGAGGAGCGGGTTGAATACTAC 

ATCCCGGGCTCCACCACCAACCCCGAGGTGTTCAAGCTGGAGGACAAGCAGCCTGGGAACATG 

GCGGGGAACTTCAACCTCTCTCTGCAAGGCTGTGACCAAGATGCCCAAAGTCCAGGGATCCTC 

CGGCTGCAGTTCCAAGTTTTGGTCCAACATCCACAAAATGAAAGCAGTGAGTGAGCCCCACTT 

TCCTTTTTCTTCCTCCTCCAGCACCTTCGTTGTTTCCTGGGTAGTCTGCCTGGGTGAGGCTCC 

CTTCCTGTTTCTCATCTGTGGCTTCTGAAACACTTAGACTCTGGACCCAGCAAGAGTTTCAGG 

AAGTGGGTTGCTAGGCAGTTAGACAGGCTTGTTGGTGAACACCCGGTATGTAGTTCCATTTCA 

GCACAATAAAAAGAAATCTTGCATTCAAGATGCTAAATTGTTTTTAACGAAAA 
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162/550 
FIGURE 162 

MAGLNCGVSIALLGVLLLGAARLPRGAEAFEIALPRESNITVLIKLGTPTLLAKPCYIVISKR 
HITMLSIKSGERIVFTFSCQSPENHFVIEIQKNIDCMSGPCPFGEVQLQPSTSLLPTLNRTFI 
WDVKAHKSIGLELQFSIPRLRQIGPGESCPDGVTHSISGRIDATWRIGTFCSNGTVSRIKMQ 
EGVKMALHLPWFHPRNVSGFSIANRSSIKRLCIIESVFEGEGSATLMSANYPEGFPEDELMTW 
QFWPAHLRASVSFLNFNLSNCERKEERVEYYIPGSTTNPEVFKLEDKQPGNMAGNFNLSLQG 

, CDQDAQSPGILRLQFQVLVQHPQNESSE 

Signal peptide: 

amino acids 1-29 

N-glycosylation sites. 

amino acids 39-43, . 122-126, 180-184, 205-209, 213-217, 270-274, 
310-314, 339-343 

Tyrosine kinase phosphorylation site. 

amino acids 276-284 

N-myristoylation' sites. 

amino acids 3-9, 7-13, 158-164, 175-181, 191-197, 303-309 
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163/550 

FT CURE 163 

CAACCACACACCTGGGGAATTGCTGGCCTGACTTCTGACCCCTGACTCCTCATACCCTTCCTC 
CAGAGCATGACATTTGACCACCAACTGAAACCTGACCTCTGACCCCAGACCACTGGCCCTTCC 
CCCGCCCTGTGGTGACTTCATAAAGGTTACTAGCTTCTCCCCTGGCCTTGAGACCCACACGAT 
6GCCCTGCTGGCTCTGGCCAGTGCCGTCCCCTCTGCCCTGCTGGCCCTGGCTGTCTTCAGGGT 
GCCCGCCTGGGCCTGTCTCCTCTGCTTCACAACCTACTCTGAGCGCCTCCGCATCTGCCAGAT 
GTTTGTTGGGATGCGGAGCCCCAAGCTTGAAGAGTGTGAGGAGGCCTTCACGGCCGCCTTCCA 
GGGCCTCTCTGACACCGAAATCAGTGAGGAGACCATCCACACTTCATCAGTGTCCTGGGGAAG 
GTGCAGAGGGAGGGCAGGAGAGGCCCAGAGGGTCAGGCTGAGGGACAGACAGAGAGAAACAGT 
CAGAGGAGAAAGGCTCAAAGACCATGAGAACAACAGAGACTTAGGGACAGAGAGACACAGACA 
GGGGAAGACAGCAGGGCAAAGACTCAGAGAGGGGAGGATGGAGAGTCAGAGAGGGGAAGATGG 
AGACTCAGAGAGAGGGGAGGATGGAGACTCAGAGAGAGAGGAAGATGGAGACTCAGAGGGAAA 
GATGGAGACTCAGGAGTATGGAGAGTCAGAGAGGGGAGGATGGACACTCAGGGGAGGATGGAG 
AGTCAGGAGGATGGAGACTCATAGAAAGGGGAGGATGGAGAGTCAGGAGAGGTTGGAGACTGG 
AGAGGGAATAGAGACCCAGAAAGGGGAGGATGGAGACTCAGAGGGTGGAAGATGGAGACTCAA 
AGAGGATGGAAACCCAGAGAGAGGAGGACAGAGATGAGGCAGAGACTAGGGGAAGCAGGATAG 
CGACTGGTCGGGGGCAGAGACTCAGGGAGGATAGAGACTCACAGAGAGGTGAGGATAGAGACT 
TGGGAGGGACTCAGGAAGCATAGCGACTGTGGGGCAAAGAGTCAGAGAGGGGAGGATACAGAC 
TTGGGAGGGCAGAGACTCAGAAACAGAATGTTCGCATTAGGGACATGGTGTTGCGGGGAGCTG 
' CCTCCCCCAGCCCCTGCTCCCTCCCTCACCGCCAGACTATGATGAGAGAAGCCACCTGCATGA 
CACCTTCACCCAGATGACCCATGCCCTGCAGGAGCTGGCTGCTGCCCAGGGATCCTTTGAGGT 
TGCCTTCCCTGATGCTGCAGAGAAAATGAAGAAGGTCATTACACAGCTTAAAGAAGCCCAGGC 
TTGCATCCCTCCCTGCGGTCTCCAGGAGTTCGCCCGGCGTTTCCTCTGCAGCGGGTGCTACTC 
TAGGGTCTGCGACCTCCCGCTGGACTGCCCAGTTCAGGATGTGACAGTGACTCGGGGCGACCA 
GGCTATGTTTTCTTGCATCGTAAACTTCCAGCTGCCAAAGGAGGAGATCACCTATTCCTGGAA 
GTTCGCAGGAGGAGGTCTCCGGACTCAGGACTTGTCCTATTTCCGAGATATGCCGCGGGCCGA 
AGGATACCTGGCGCGGATCCGGCCGGCTCAGCTCACGCACCGCGGGACGTTCTCCTGCGTGAT 
CAAGCAAGACCAGCGCCCCCTGGCCCGGCTCTACTTCTTTCTTAACGTCCTCGGGGCCCTCGC 
ATCAGCGAGTGCGACAGTGTTGGCGTGGTGAGTTCTGGGGACTCCGGAGCCCCAGCATCTAGC 
TCCCCGCTGTCTCAGATCCCACCGAGAAGTCTGGGTTCCCAGCAACCTCCAACCCAGGAGGAT 
GTTCTTTCGATGGTACTGCAGTGGCAACTAACAAAGGTATCTTTCCTCCTTCCCTATCCTATT 
TCCATCCTGAAAATAAAGAATATATTTCAACTCTAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAA 
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164/550 

FIGURE 164 

MALLALASAVPSALLALAVFRVPAWACLLCFTTYSERLRICQMFVGMRSPKLEECEEAFTAAF 

QGLS DTE I SEETI HT S S VSWGRCRGRAGEAQRVRLRDRQRET VRGERLKDHENN RDLGTERHR 
QGKTAGQRLREGRMESQRGEDGDSERGEDGDSEREEDGDSEGKMETQEYGESERGGWTLRGGW 

RVRRMETHRKGRMESQERLETGEGIETQKGEDGDSEGGRWRLKEDGNPERGGQR 

Signal peptide: 

amino acids 1-26 

N-myristoylation site. 

amino acids 65-71 
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165/550 
FT CURE 165 

CAGAATCGCAGATTGCCAGCCCTTTTCCCGACCCCTACGGAAAGACGAGTCCAGGGGCCGTCC 

TGGCGAGGTCAAAACATTTAGTCTGGTCTTTTCAGCGTGGACCCTGCCAGCAGCCAGGCCATS 

GAGCTCTCTGATGTCACCCTCATTGAGGGTGTGGGTAATGAGGTGATGGTGGTGGCAGGTGTG 

GTGGTGCTGATTCTAGCCTTGGTCCTAGCTTGGCTCTCTACCTACGTAGCAGACAGCGGTAGC 

AACCAGCTCCTGGGCGCTATTGTGTCAGCAGGCGACACATCCGTCCTCCACCTGGGGCATGTG 

GACCACCTGGTGGCAGGCCAAGGCAACCCCGAGCCAACTGAACTCCCCCATCCATCAGAGGGT 

AATGATGAGAAGGCTGAAGAGGCGGGTGAAGGTCGGGGAGACTCCACTGGGGAGGCTGGAGCT 

GGGGGTGGTGTTGAGCCCAGCCTTGAGCATCTCCTTGACATCCAAGGCCTGCCCAAAAGACAA 

GCAGGTGCAGGCAGCAGCAGTCCAGAGGCCCCCCTGAGATCTGAGGATAGCACCTGCCTCCCT 

CCCAGCCCTGGCCTCATCACTGTGCGGCTCAAATTCCTCAATGATACCGAGGAGCTGGCTGTG 

GCTAGGCCAGAGGATACCGTGGGTGCCCTGAAGAGCAAATACTTCCCTGGACAAGAAAGCCAG 

ATGAAACTGATCTACCAGGGCCGCCTGCTACAAGACCCAGCCCGCACACTGCGTTCTCTGAAC 

ATTACCGACAACTGTGTGATTCACTGCCACCGCTCACCCCCAGGGTCAGCTGTTCCAGGCCCC 

TCAGCCTCCTTGGCCCCCTCGGCCACTGAGCGACCCAGCCTTGGTGTCAATGTGGGCAGCCTC 

ATGGTGCCTGTCTTTGTGGTGCTGTTGGGTGTGGTCTGGTACTTCCGAATCAATTACCGCCAA 

TTCTTCACAGCACCTGCCACTGTCTCCCTGGTGGGAGTCACCGTCTTCTTCAGCTTCCTAGTA 

TTTGGGATGTATGGACGATAAGGACATAGGAAGAAAATGAAAGGCATGGTCTTTCTCCTTTAT 

GGCCTCCCCACTTTTCCTGGCCAGAGCTGGGCCCAAGGGCCGGGGAGGGAGGGGTGGAAAGGA 

TGTGATGGAAATCTCCTCCATAGGACACAGGAGGCAAGTATGCGGCCTCCCCTTCTCATCCAC 

AGGAGTACAGATGTCCCTCCCGTGCGAGCACAACTCAGGTAGAAATGAGGATGTCATCTTCCT 

TCACTTTTAGGGTCCTCTGAAGGAGTTCAAAGCTGCTGGCCAAGCTCAGTGGGGAGCCTGGGC 

TCTGAGATTCCCTCCCACCTGTGGTTCTGACTCTTCCCAGTGTCCTGCATGTCTGCCCCCAGC 

ACCCAGGGCTGCCTGCAAGGGCAGCTCAGCATGGCCCCAGCACAACTCCGTAGGGAGCCTGGA 

GTATCCTTCCATTTCTCAGCCAAATACTCATCTTTTGAGACTGAAATCACACTGGCGGGAATG 

AAGATTGTGCCAGCCTTCTCTTATGGGCACCTAGCCGCCTTCACCTTCTTCCTCTACCCCTTA 

GCAGGAATAGGGTGTCCTCCCTTCTTTCAAAGCACTTTGCTTGCATTTTATTTTATTTTTTTA 

AGAGTCCTTCATAGAGCTCAGTCAGGAAGGGGATGGGGCACCAAGCCAAGCCCCCAGCATTGG 

GAGCGGCCAGGCCACAGCTGCTGCTCCCGTAGTCCTCAGGCTGTAAGCAAGAGACAGCACTGG 

CCCTTGGCCAGCGTCCTACCCTGCCCAACTCCAAGGACTGGGTATGGATCGCTGGGCCCTAGG 

CTCTTGCTTCTGGGGCTATTGGAGGGTCAGTGTCTGTGACTGAATAAAGTTCCATTTTGTGGA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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166/550 

FIGURE 166 

MELSDVTLIEGVGNEVMWAGVVVLILALVLAWLSTYVADSGSNQLLGAIVSAGDTSVLHLGH 
VDHLVAGQGNPEPTELPHPSEGNDEKAEEAGEGRGDSTGEAGAGGGVEPSLEHLLDIQGLPKR 
QAGAGSSSPEAPLRSEDSTCLPPSPGLITVRLKFLNDTEELAVARPEDTVGALKSKYFPGQES 
QMKLIYQGRLLQDPARTLRSLNITDNCVIHCHRSPPGSAVPGPSASLAPSATEPPSLGVNVGS 
LMVPVFWLLGVVWYFRINYRQFFTAPATVSLVGVTVFFSFLVFGMYGR 

Signal peptide: 

amino acids 1-36 

Transmembrane domains: 

amino acids 246-267, 275-301 

N-glycosylation sites. 

amino acids 162-166, 211-215 

N-myristoylation sites. 

amino acids 48-54, 105-111, 109-115, 129-135, 177-183, 247-253 



Cell attachment sequence. 

amino acids 97-100 
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167/550 

FIGURE 167 

GGCGGCTGTGTGTCGCCGGAGCCGAAGCGCGCAGGCCCGTCCCGGTGGCCGGGGAGCGGGCGGGTGGGGGCGCC^ 

Sggttcatgtacctgctgagctggctgtcgctcttcatccaggtggccttcatcacgctggctgtcgcggctg 

ISSaSacSggcagaac^ 

?ctcca?cgctgtactgattggcctctacgtctttgagcgc^ 

JcaaStcgtctactttggcctcctccagaccttccccttcatcatgctg 

g5Sac?ag?ggtggtgaatcattacctagcatttcagttttttgcagaagaa^ 

5ggcctat?tcactttctgcctgtggataattccgtttgcgttttttgtgtcac 

tcccSctaccatgcagccaggagatgatgtcgtc^^ 

ggScSggttgtcttctccttcatcaaagaggccattctacccagtcgtcagaagatatac^^ 
ggcaggatgtggLggcaagat^^ 

ArrCACCTACCACCTGCCCTGGCTTTCTTCCCCTCAACTCTGGAGCCCCATCCCCACCCTCCTTGGGGGGCTCAG 

AGGAGGCCGAGGTCCCTCAGTCCTCCCCTGTCCCTTCCAAGGTGGGTCAGGAGGTTCTGGCCCCGCTGGGGCAGG 

^AGGGCAGGGTCTGTGAAGCTTAAGAGCAGATGGTGAC^ 

GCTTGGCATGTCCAACAGAAATAG 

S?Sggactggactcacttttctgctccagggactccttttct 
g?tcttg?ggcccttaatgggaagggggcaggggtgaggagct(^gcctgctcaagga 

ArrCAGCCTCTCTGATGCACTCTCTTCCATCTCTTTCCCCAAGGCTCCGTGACTGTCAAACTGGGAGTAGGAGAG 

A^rAG^GTTTCTGGCTGAATACTATCTTAGGCTCAAGGAGAAAC 
TTTTAAAGAAATGGGACTAATGGGAG 

SScCTGGGGAGA^TATCTGCCTTCTAAGAAGGCAGGCCAGGT 

ccSgcactccaactatagtgccttgcaagtgctcaacagtacatattggaatgaagtcccta 

SggccatgSctatacctcaaagtgaggctg^ 
Sggaaaaacccctgtgcttgaaaatatttcctctatatca 

?ggc??^atttaccttctcttaagaataccagctttcccctttccctgagaggaagagcacatgttggtc^ 

ctcSag?ctgaacgagattgccaggcccttttctcctatgcacaccaggatagacaagg^^ 

gcctgcatcatcctcccattgggctgacagctggccctactttcctccct 

GATGTGGCTTCGCCCCCTCGACTCTACT 

tc^aSaStctcggcIgcatttgacatggctgatcaccccttgcttcttg 

S?g?tagttgtcctacctccctggctgttcctttcaggcttccgtgcaggcttctccacttgcccatgcacagt 
aggg?S??caggS?S 

^TAGAGTTGACCTCCAGCCCCAACATCCTTCCTG 

ACAATTCAGGTCCATGGCCCAGATGGTACTTGCTGTCTTCTGCAAACCTGCCCCTTCTGGGTACTTCCCT 
^CGtcATCACTCAGGAGCCAGACAGGAAACTTATT 

SSggtcaggtctacctt^ 

cg^agEctcctactgcttctctatcatgtggccagagctatcttcctaaaatgcattgcat 

SrScTCTGGCCTAAAACCTTCCTTGGCTCCCTGCTGCCCT 

ct?ctgtatccagagatgcagtggc 

ArTTTCTCTGTCAAATCTCAACTTAGACTTGACTTCCTCCAAGGAGCTTTGGCTATACTCTCTCCTCCCGACCCC 

JaccctggSac^cacagItcact 
ScSgaggSSStgtgw 

TCAATAAATGTTTGTTTAATGAA 
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168/550 

FIGURE 168 

MWFMYLLSWLSLFIQVAFITLAVAAGLYYLAELIEEYTVATSRIIKYMIWFSTAVLIGLYVFE 
RFPTSMIGVGLFTNLVYFGLLQTFPFIMLTSPNFILSCGLWVNHYLAFQFFAEEYYPFSEVL 
AYFTFCLWIIPFAFFVSLSAGENVLPSTMQPGDDWSNYFTKGKRGKRLGILWFSFIKEAIL 

PSRQKIY 

Signal peptide: 

amino acids 1-25 

Transmembrane domain : 

amino acids 126-146 

Casein kinase II phosphorylation site. 

amino acids 145-148 

N-myristoylation sites. 

amino acids 73-78, 82-87 

Amidation sites. 

amino acids 168-171, 171-174 



Prokaryotic membrane lipoprotein lipid attachment 

amino acids 91-101 
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169/550 

FTftURE 169 

CAAAGCCCTACCCTCACCATTCACCAGGTCCTGTGGGAAGAGCAGCGTGGAGGTGGGCTGAGG 

TTAGAAGGTGCAGAGCGTGGAAGAAGATTGTGAGCTGAGTATTGGACATCTGTTCTTGAATAG 

TGCCTGGGCCTGCCATAGGAAAGGAAGTTCTCCAGGGTTACAGTTCTTATCCGCGTGAATACA 

CATGGCTCTGTTACGAAAAATTAATCAGGTGCTGCTGTTCCTTCTGATCGTGACCCTCTGTGT 

GATTCTGTATAAGAAAGTTCATAAGGGGACTGTGCCCAAGAATGACGCAGATGATGAATCCGA 

GACTCCTGAAGAACTGGAAGAAGAGATTCCTGTGGTGATTTGTGCTGCAGCAGGGAGGATGGG 

TGCCACTATGGCTGCCATCAATAGCATCTACAGCAACACTGACGCCAACATCTTGTTCTATGT 

AGTGGGACTCCGGAATACTCTGACTCGAATACGAAAATGGATTGAACATTCCAAACTGAGAGA 

AATAAACTTTAAAATCGTGGAATTCAACCCGATGGTCCTCAAAGGGAAGATCAGACCAGACTC 

ATCGAGGCCTGAATTGCTCCAGCCTCTGAACTTTGTTCGATTTTATCTCCCTCTACTTATCCA 

CCAACACGAGAAAGTCATCTATTTGGACGATGATGTAATTGTACAAGGTGATATCCAAGAACT 

GTATGACACCACCTTGGCCCTGGGCCACGCGGCGGCTTTCTCAGATGACTGCGATTTGCCCTC 

TGCTCAGGACATAAACAGACTCGTGGGACTTCAGAACACATATATGGGCTATCTGGACTACCG 

GAAGAAGGCCATCAAGGACCTTGGCATCAGCCCCAGCACCTGCTCTTTCAATCCTGGTGTGAT 

TGTTGCCAACATGACAGAATGGAAGCACCAGCGCATCACCAAGCAATTGGAGAAATGGATGCA 

AAAGAATGTGGAGGAAAACCTCTATAGCAGCTCCCTGGGAGGAGGGGTGGCCACCTCCCCAAT 

GCTGATTGTGTTTCATGGGAAATATTCCACAATTAACCCCCTGTGGCACATAAGGCACCTGGG 

CTGGAATCCAGATGCCAGATATTCGGAGCATTTTCTGCAGGAAGCTAAATTACTCCACTGGAA 

TGGAAGACATAAACCTTGGGACTTCCCTAGTGTTCACAACGACTTATGGGAAAGCTGGTTTGT 

TCCTGACCCTGCAGGGATATTTAAACTCAATCACCATAGCTGATATAACTCTACCCTTAAAAT 

ATTCCCTGTATAGAAATGTGGAATTGTCCCTTTGTAGCCAACTATAACATTGTTCTTTATGAA 

TATTACCTTTGATACATATGATCCACAATATAAAAACCAAAAACTACTGTGTGCAAATTATAC 

CTTGGACCATATAGGCATTGATTAACTTCTTTAAGTACATGTGATAACTATGGAAATCAAGAT 

TATGTGACTGAAAAACATAAAGGAAGAGACCCATCTAGATAACAGCAATCAACCTGCTTAATT 

CTGAATGACAATTATATCCACAAATTTTTAAAACTTCTACATGTATTTTTCACATGAAGATCT 

CCTTAACAGGTTGCCAACCTTTTCTTTTATAAAACTATTACATTTAAAATATGGACGTCTGAA 

AAATAAAATATTCATCATTTTTAAAA 
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170/550 
FTGURE 170 

MALLRKINQVLLFLLIVTLCVILYKKVHKGTVPKNDADDESETPEELEEEIPVVICAAAGRMG 
ATMAAINSIYSNTDANILFYWGLRNTLTRIRKWIEHSKLREINFKIVEFNPMVLKGKIRPDS 
SRPELLQPLNFVRFYLPLLIHQHEKVIYLDDDVIVQGDIQELYDTTLALGHAAAFSDDCDLPS 
AQDINRLVGLQNTYMGYLDYRKKAIKDLGISPSTCSFNPGVIVANMTEWKHQRITKQLEKWMQ 
KNVEENLYSSSLGGGVATSPMLIVFHGKYSTINPLWHIRHLGWNPDARYSEHFLQEAKLLHWN 

GRHKPWDFPSVHNDLWESWFVPDPAGIFKLNHHS 

Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 234-238 

Tyrosine kinase phosphorylation site. 

amino acids 253-261 

N-myristoylation sites. 

amino acids 63-69, 86-92, 198-204, 218-224, 229-235, 265-271, 
266-272 
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171/550 

FIGURE 171 

GCCAGAGGCTGCAGCTGGAGCCCAGAGCCCAAGATGGAGCCCCAGCTGGGGCCTGAGGCTGCC 
GCCCTCCGCCCTGGCTGGCTGGCCCTGCTGCTGTGGGTCTCAGCCCTGAGCTGTTCTTTCTCC 
TTGCCAGCTTCTTCCCTTTCTTCTCTGGTGCCCCAAGTCAGAACCAGCTACAATTTTGGAAGG 
ACTTTCCTCGGTCTTGATAAATGCAATGCCTGCATCGGGACATCTATTTGCAAGAAGTTCTTT 
AAAGAAGAT^ATAAGATCTGACAACTGGCTGGCTTCCCACCTTGGACTGCCTCCCGATTCCTTG 
CTTTCTTATCCTGCAAATTACTCAGATGATTCCAAAATCTGGCGCCCTGTGGAGATCTTTAGA 
CTGGTCAGCAAATATCAAAACGAGATCTCAGACAGGAGAATCTGTGCCTCTGCATCAGCCCCA 
AAGACCTGCAGCATTGAGCGTGTCCTGCGGAAAACAGAGAGGTTCCAGAAATGGCTGCAGGCC 
AAGCGCCTCACGCCGGACCTGGTGCAGGACTGTCACCAGGGCCAGAGAGAACTAAAGTTCCTG 
TGTATGCTGAGATAACACCAGTGAAAAAGCCTGGCATGGAGCCCAGCACTGAGAACTTCCAGA 
AAGTGTTAGCCTTCTCCCAACTGTGTTATACCAACCACATTTTCAAATAGTAATCATTAAAGA 
GGCTTCTGCATC7\AAAAAAAAAAAAAAAAAAAAA7U\AAAAAAAAAAAAAAAAAAAAAA 
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172/550 
FT SURE 172 

MEPQLGPEAAALRPGWLALLLWVSALSCSFSLPASSLSSLVPQVRTSYNFGRTFLGLDKCNAC 
IGTSICKKFFKEEIRSDNWLASHLGLPPDSLLSYPANYSDDSKIWRPVEIFRLVSKYQNEISD 
RRICASASAPKTCSIERVLRKTERFQKWLQAKRLTPDLVQDCHQGQRELKFLCMLR 

Signal peptide: 

amino acids 1-28 

N-glycosylation site, 

amino acids 100-103 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 158-161 

N-myristoylation sites. 

amino acids 56-61, 65-70 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 18-28 

Prenyl group binding site (CAAX box) . 

amino acids 179-182 

Leucine zipper pattern. 

amino acids 5-26 
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173/550 

FTGURE 173 

GCTGGACTGCTCGCTGGCCGGCAGCGCACCGTTTTGAAGGTCCTAGCCCACCTGGGCTGGCTC 

ACGCGCACGACTAGCCGCTCCCATACAGCACGCCCGGACTCTGTCGTCGCTTAAGGCCACTCC 

TATTCTACGGCTGACCCCTGGTGGTCACGTGGATCTGTTCGCCACGCAAGTCTGGGTCCTTCG 

GCGATTGACCGGGGTCCTTGCTGTTCGGGAGCCTCTCCTAAGCTGCCTGTTCGCGCGAGAGTT 

TGGAGGGGCGGGTTTGGGGTCGGTGTCTGATTGGGGCTCGCACCGCAGCACGCTGGAGTCCCG 

CTTAGGTACCAGTTAGCGTCAGGGGAGCTGGGTCAGGCGGTCGCCGGGACACCCCGTGTGTGG 

CAGGCGGCGAAGCGCTCTGGAGAATCCCGGACAGCCCTGCTCCCTGCAGCCAGGTGTAGTTTC 

GGGAGCCACTGGGGCCAAAGTGAGAGTCCAGCGGTCTTCCAGCGCTTGGGCCACGGCGGCGGC 

CCTGGGAGCAGAGGTGGAGCGACCCCATTACGCTAAAGASSAAAGGCTGGGGTTGGCTGGCCC 

TGCTTCTGGGGGCCCTGCTGGGAACCGCCTGGGCTCGGAGGAGCCAGGATCTCCACTGTGGAG 

CATGCAGGGCTCTGGTGGATGAACTAGAATGGGAAATTGCCCAGGTGGACCCCAAGAAGACCA 

TTCAGATGGGATCTTTCCGGATCAATCCAGATGGCAGCCAGTCAGTGGTGGAGGTGCCTTATG 

CCCGCTCAGAGGCCCACCTCACAGAGCTGCTGGAGGAGATATGTGACCGGATGAAGGAGTATG 

GGGAACAGATTGATCCTTCCACCCATCGCAAGAACTACGTACGTGTAGTGGGCCGGAATGGAG 

AATCCAGTGAACTGGACCTACAAGGCATCCGAATCGACTCAGATATTAGCGGCACCCTCAAGT 

TTGCGTGTGAGAGCATTGTGGAGGAATACGAGGATGAACTCATTGAATTCTTTTCCCGAGAGG 

CTGACAATGTTAAAGACAAACTTTGCAGTAAGCGAACAGATCTTTGTGACCATGCCCTGCACA 

TATCGCATGATGAGCTAT^ACCACTGGAGCAGCCCACACTGGCTTGATGGATCACCCCCAGG 

AGGGGAAAATGGTGGCAATGCCTTTTATATATTATGTTTTTACTGAAATTAACTGAAAAAATA 



TGAAACCAAAAGT 



WO 01/40466 



PCT/US00/32678 



174/550 

FIGURE 174 

MKGWGWLALLLGALLGTAWARRSQDLHCGACRALVDELEWEIAQVDPKKTIQMGSFRINPDGS 
QSWEVPYARSEAHLTELLEEICDRMKEYGEQIDPSTHRKNYVRVVGRNGESSELDLQGIRID 
SDISGTLKFACESIVEEYEDELIEFFSREADNVKDKLCSKRTDLCDHALHISHDEL 

Signal peptide: 

amino acids 1-20 

N-myristoylation sites. 

amino acids 12-18, 16-22, 29-35 

Endoplasmic reticulum targeting sequence. 

amino acids 179-184 
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175/550 
FIGURE 175 

CGCAGCGCGGCAGTCCTGATGGCCCGGCATGGGTTACCGCTGCTGCCCCTGCTGTCGCTCCTG 

GTCGGCGCGTGGCTCAAGCTAGGAAATGGACAGGCTACTAGCATGGTCCAACTGCAGGGTGGG 

AGATTCCTGATGGGAACAAATTCTCCAGACAGCAGAGATGGTGAAGGGCCTGTGCGGGAGGCG 

ACAGTGAAACCCTTTGCCATCGACATATTTCCTGTCACCAACAAAGATTTCAGGGATTTTGTC 

AGGGAGAAAAAGTATCGGACAGAAGCTGAGATGTTTGGATGGAGCTTTGTCTTTGAGGACTTT 

GTCTCTGATGAGCTGAGAAACAAAGCCACCCAGCCAATGAAGTCTGTACTCTGGTGGCTTCCA 

GTGGAAAAGGCATTTTGGAGGCAGCCTGCAGGTCCTGGCTCTGGCATCCGAGAGAGACTGGAG 

CACCCAGTGTTACACGTGAGCTGGAATGACGCCCGTGCCTACTGTGCTTGGCGGGGAAAACGA 

CTGCCCACGGAGGAAGAGTGGGAGTTTGCCGCCCGAGGGGGCTTGAAGGGTCAAGTTTACCCA 

TGGGGGAACTGGTTCCAGCCAAACCGCACCAACCTGTGGCAGGGAAAGTTCCCCAAGGGAGAC 

AAAGCTGAGGATGGCTTCCATGGAGTCTCCCCAGTGAATGCTTTCCCCGCCCAGAACAACTAC 

GGGCTCTATGACCTCCTGGGGAACGTGTGGGAGTGGACAGCATCACCGTACCAGGCTGCTGAG 

CAGGACATGCGCGTCCTCCGGGGGGCATCCTGGATCGACACAGCTGATGGCTCTGCCAATCAC 

CGGGCCCGGGTCACCACCAGGATGGGCAACACTCCAGATTCAGCCTCAGACAACCTCGGTTTC 

CGCTGTGCTGCAGACGCAGGCCGGCCGCCAGGGGAGCTGTAAGCAGCCGGGTGGTGACAAGGA 

GAAAAGCCTTCTAGGGTCACTGTCATTCCCTGGCCATGTTGCAAACAGCGCAATTCCAAGCTC 

GAGAGCTTCAGCCTCAGGAAAGAACTTCCCCTTCCCTGTCTCCCATCCCTCTGTGGCAGGCGC 

CTCTCACCAGGGCAGGAGAGGACTCAGCCTCCTGTGTTTTGGAGAAGGGGCCCAATGTGTGTT 

GACGATGGCTGGGGGCCAGGTGTTTCTGTTAGAGGCCAAGTATTATTGACACAGGATTGCAAA 

CACACAAACAGTTGGAACAGAGCACTCTGAAAGGCCATTTTTTAAGCATTTTAAAATCTATTC 

TCTCCCCCTTTCTCCCTGGATGATTCAGGAAGCTGACATTGTTTCCTCAAGGCAGAATTTTCC 

TGGTTCTGTTTTCTCAGCCAGTTGCTGTGGAAGGAGAATGCTTTCTTTGTGGCCTCATCTGTG 

GTTTCGTGTCCCTCTGAAGGAAACTAGTTTCCACTGTGTAACAGGCAGACATGTAACTATTTA 

AAGCACAGTTCAGTCCTAAAAGGGTCTGGGAGAACCAGATGATGTACTAGGTGAAGCATTGCA 

TTGTGGGAATCACAAAGCAAATAGTACTCCAGAAAGACAAATATCAGAAGCTTCCTATTCTTT 

TTTTTTTTTTTTTTTTTTTTTTGAGACAGGGTCTTTCTCTGTTGCCCAGGCTAGAGTGCACTG 

GTGATCACGGCTCACTCTAGCCTTGAATTCCTGGGCCCAAGCAATTCTCCCACCTCAGCCTCC 

TGAGTAGCTGGGACTACAAGTGTGCACCACCATGCCTGGCTAATTTTTTGAATTTTTGTAGTG 

ATGGGATCTCGCTCTGTTGCCCAGGGTGGTCTCGAACTCCTGGCCTCAAGCGATCCTCCCACC 

TCGACCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCTCGCCTGGGCCCCCTTCTCCATA 

TGCCTCCAAAAACATGTCCCTGGAGAGTAGCCTGCTCCCACACTGTCACTGGATGTCATGGGG 

CCAATAAAATCTCCTGCAATTGTGTATCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAA 
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176/550 

FIGURE 176 

MARHGLPLLPLLSLLVGAWLKLGNGQATSMVQLQGGRFLMGTNSPDSRDGEGPVREATVKPFA 
IDIFPVTNKDFRDFVREKKYRTEAEMFGWSFVFEDFVSDELRNKATQPMKSVLWWLPVEKAFW 
RQPAGPGSGIRERLEHPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQ 
PNRTNLWQGKFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTASPYQAAEQDMRVL 
RGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPGEL 

Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 191-195 

N-myristoylation sites. 

amino acids 23-29, 25-31,- 175-181 

Amidation site. 

amino acids 159-163 
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177/550 

FIGURE 177 

GCCTTCTCGCGCCTGACCATGCACCCCTGCATCTTCCTGCTGGGCCACAGGCGAGCGCTTTAT 

TTCTGGAGCTGAGGGCTAAAACTTTTTTGACTTTTCTTCTCCTCAACATCTGAATCMGCCAT 

GTGCCCAGAGGAGCTGGCTTGCAAACCTTTCCGTGGTGGCTCAGCTCCTTAACTTTGGGGCGC 

TTTGCTATGGGAGACAGCCTCAGCCAGGCCCGGTTCGCTTCCCGGACAGGAGGCAAGAGCATT 

TTATCAAGGGCCTGCCAGAATACCACGTGGTGGGTCCAGTCCGAGTAGATGCCAGTGGGCATT 

TTTTGTCATATGGCTTGCACTATCCCATCACGAGCAGCAGGAGGAAGAGAGATTTGGATGGCT 

CAGAGGACTGGGTGTACTACAGAATTTCTCACGAGGAGAAGGACCTGTTTTTTAACTTGACGG 

TCAATCAAGGATTTCTTTCCAATAGCTACATCATGGAGAAGAGATATGGGAACCTCTCCCATG 

TTAAGATGATGGCTTCCTCTGCCCCCCTCTGCCATCTCAGTGGCACGGTTCTACAGCAGGGCA 

CCAGAGTTGGGACGGCAGCCCTCAGTGCCTGCCATGGACTGACTGGATTTTTCCAACTACCAC 

ATGGAGACTTTTTCATTGAACCCGTGAAGAAGCATCCACTGGTTGAGGGAGGGTACCACCCGC 

ACATCGTTTACAGGAGGCAGAAAGTTCCAGAAACCAAGGAGCCAACCTGTGGATTAAAGGGTA 

TTGTGACTCACATGTCCTCCTGGGTTGAAGAATCTGTTTTGTTCTTTTGGT&6TTTTATTAAA 

ACATGACCTATTCTTACTCAAGTCTCTTATCTCCTCTGTATTCTTTTTTTTTTAATATCTTCA 

TGACATTCAAATCTCTTCTGTATTCTCTTGCCAGAAAGTGTACATTCTTTTTGCTTGTATAAA 

CCCTTTCACTTGTC 
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178/550 

FIGURE 178 

MPCAQRSWLANLSWAQLLNFGALCYGRQPQPGPVRFPDRRQEHFIKGLPEYHWGPVRVDAS 
GHFLSYGLHYPITSSRRKRDLDGSEDWVYYRISHEEKDLFFNLTVNQGFLSNSYIMEKRYGNL 
SHVKMMASSAPLCHLSGTVLQQGTRVGTAALSACHGLTGFFQLPHGDFFIEPVKKHPLVEGGY 
HPHIVYRRQKVPETKEPTCGLKGIVTHMSSWVEESVLFFW 

Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 11-15, 105-109, 125-129 

N-myristoylation site. 

amino acids 149-155 
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179/550 

FIGURE 179 

CAGATTTAAAAAGAAAACCTTTACTGAATCAGCTGAGTGTTAATAATACGAATTTCCTTTTCT 

TGCCAATTCTGATCTGAACAGAAAATCCAAGAACAGGGATATGTGTGGATTACAGTTTTCTCT 

GCCTTGCCTACGACTGTTTCTGGTTGTTACCTGTTATCTTTTATTATTACTCCACAAAGAAAT 

AGTTGGATGTTCGTCTGTTTGTCAGCTCTGCACTGGGAGACAAATTAACTGCCGTAACTTAGG 

CCTTTCGAGTATTCCTAAGAATTTTCCTGAAAGTACAGTTTTTCTGTATCTGACTGGGAATAA 

TATATCTTATATAAATGAAAGTGAATTAACAGGACTTCATTCTCTTGTAGCATTGTATTTGGA 

TAATTCTAACATTCTGTATGTATATCCAAAAGCCTTTGTTCAATTGAGGCATCTATATTTTCT 

ATTTCTAAATAATAATTTCATCAAACGCTTAGATCCTGGAATATTTAAGGGACTTTTAAATCT 

TCGTAATTTATATTTACAGTATAATCAGGTATCTTTTGTTCCGAGAGGAGTATTTAATGATCT 

AGTTTCAGTTCAGTACTTAAATCTACAAAGGAATCGCCTCACTGTCCTTGGGAGTGGTACCTT 

TGTTGGTATGGTTGCTCTTCGGATACTTGATTTATCAAACAATAACATTTTGAGGATATCAGA 

ATCAGGCTTTCAACATCTTGAAAACCTTGCTTGTTTGTATTTAGGAAGTAATAATTTAACAAA 

AGTACCATCAAATGCCTTTGAAGTACTTAAAAGTCTTAGAAGACTTTCTTTGTCTCATAATCC 

TATTGAAGCAATACAGCCCTTTGCATTTAAAGGACTTGCCAATCTGGAATACCTCCTCCTGAA 

AAATTCAAGAATTAGGAATGTTACTAGGGATGGGTTTAGTGGAATTAATAATCTTAAACATTT 

GATCTTAAGTCATAATGATTTAGAGAATTTAAATTCTGACACATTCAGTTTGTTAAAGAATTT 

AATTTACCTTAAGTTAGATAGAAACAGAATAATTAGCATTGATAATGATACATTTGAAAATAT 

GGGAGCATCTTTGAAGATCCTTAATCTGTCATTTAATAATCTTACAGCCTTGCATCCAAGGGT 

CCTTAAGCCGTTGTCTTCATTGATTCATCTTCAGGCAAATTCTAATCCTTGGGAATGTAACTG 

CAAACTTTTGGGCCTTCGAGACTGGCTAGCATCTTCAGCCATTACTCTAAACATCTATTGTCA 

GAATCCCCCATCCATGCGTGGCAGAGCATTACGTTATATTAACATTACAAATTGTGTTACATC 

TTCAATAAATGTATCCAGAGCTTGGGCTGTTGTAAAATCTCCTCATATTCATCACAAGACTAC 

TGCGCTAATGATGGCCTGGCATAAAGTAACCACAAATGGCAGTCCTCTGGAAAATACTGAGAC 

TGAGAACATTACTTTCTGGGAACGAATTGCTACTTCACCTGCTGGTAGATTTTTTCAAGAGAA 

TGCCTTTGGTAATCCATTAGAGACTACAGCAGTGTTACCTGTGCAAATACAACTTACTACTTC 

TGTTACCTTGAACTTGGAAAAAAACAGTGCTCTACCGAATGATGCTGCTTCAATGTCAGGGAA 

AACATCTCTAATTTGTACACAAGAAGTTGAGAAGTTGAATGAGGCTTTTGACATTTTGCTAGC 

TTTTTTCATCTTAGCTTGTGTTTTAATCATTTTTTTGATCTACAAAGTTGTTCAGTTTAAACA 

AAAACTAAAGGCATCAGAAAACTCAAGGGAAAATAGACTTGAATACTACAGCTTTTATCAGTC 

AGCAAGGTATAATGTAACTGGCTCAATTTGTAACACTTCCCCAAATTCTCTAGAAAGTCCTGG 

CTTGGAGCAGATTCGACTTCATAAACAAATTGTTCCTGAAAATGAGGCACAGGTCATTCTTTT 

TGAACATTCTGCTTTATAACTCAACTAAATATTGTCTATAAGAAACTTCAGTGCCATGGACAT 

GATTTAAACTGAAACCTCCTTATATAATTATATACTTTAGTTGGAAATATAATGAATTATATG 

AGGTT AGC AT TAT T AAAATATGTT TTTTNTT AAAAAAAAAAAAAAAAAAAAAAAA 
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180/550 

FT CURE 180 

MCGLQFSLPCLRLFLVVTCYLLLLLHKEILGCSSVCQLCTGRQINCRNLGLSSIPKNFPESTV 
FLYLTGNNISYINESELTGLHSLVALYLDNSNILYVYPKAFVQLRHLYFLFLNNNFIKRLDPG 
IFKGLLNLRNLYLQYNQVSFVPRGVFNDLVSVQYLNLQRNRLTVLGSGTFVGMVALRILDLSN 
NNILRISESGFQHLENLACLYLGSNNLTKVPSNAFEVLKSLRRLSLSHNPIEAIQPFAFKGLA 
NLEYLLLKNSRIRNVTRDGFSGINNLKHLILSHNDLENLNSDTFSLLKNLIYLKLDRNRIISI 
DNDTFENMGASLKILNLSFNNLTALHPRVLKPLSSLIHLQANSNPWECNCKLLGLRDWLASSA 
ITLNIYCQNPPSMRGRALRYINITNCVTSSINVSRAWAVVKSPHIHHKTTALMMAWHKVTTNG 
SPLENTETENITFWERIPTSPAGRFFQENAFGNPLETTAVLPVQIQLTTSVTLNLEKNSALPN 
DAASMSGKTSLICTQEVEKLNEAFDILLAFFILACVLIIFLIYKWQFKQKLKASENSRENRL 
EYYSFYQSARYNVTASICNTSPNSLESPGLEQIRLHKQIVPENEAQVILFEHSAL 

Signal peptide: 

amino acids 1-41 

Transmembrane domain : 

amino acids 530-547 

N-glycosylation sites. 

amino acids 71-75, 76-80, 215-219, 266-270, 317-321, 331-335, 
336-340, 400-404, 410-414, 451-455, 579-583 

CAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 231-235 

N-xnyristoylation sites . 

amino acids 3-9, 69-75, 126-132, 174-180 



ATP/GTP-binding site motif A (P-loop) . 

amino acids 506-514 
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181/550 

FIGURE 181 

GGCCTGGCGCGGCGCTCCGGTAAGGCGTGTGTGCGGCAGGGCGGGGACAGAACCGTCCTCTCG 

GGCTCTGGGCGTGTCCGAGACCGCGCTCCCCGCCGAAATCAAGCTCCGAGTCATCCGTGTGGG 

GCATTCGTCCCCCCTGGCACAGTTGGCCTCTTTCCAGAAGCCCGTTTTGTTTGTTTTACGTCT 

AAATTCGCGTCGGTTCTTATTTCTCTCCCTGGCAAGGTCTGAAGACGGGTAGGAGAATAACCT 

GTGTCAGCGTGTTATGATGCCGTCCCGTACCAACCTGGCTACTGGAATCCCCAGTAGTAAAGT 

GAAATATTCAAGGCTCTCCAGCACAGACGATGGCTACATTGACCTTCAGTTTAAGAAAACCCC 

TCCTAAGATCCCTTATAAGGCCATCGCACTTGCCACTGTGCTGTTTTTGATTGGCGCCTTTCT 

CATTATTATAGGCTCCCTCCTGCTGTCAGGCTACATCAGCAAAGGGGGGGCAGACCGGGCCGT 

TCCAGTGCTGATCATTGGCATTCTGGTGTTCCTACCCGGATTTTACCACCTGCGCATCGCTTA 

CTATGCATCCAAAGGCTACCGTGGTTACTCCTATGATGACATTCCAGACTTTGATGACSAGCA 

CCCACCCCATAGCTGAGGAGGAGTCACAGTGGAACTGTCCCAGCTTTAAGATATCTAGCAGAA 

ACTATAGCTGAGGACTAAGGAATTCTGCAGCTTGCAGATGTTTAAGAAAATAATGGCCAGATT 

TTTTGGGTCCTTCCCAAAGATGTTAAGTGAACCTACAGTTAGCTAATTAGGACAAGCTCTATT 

TTTCATCCCTGGGCCCTGACAAGTTTTTCCACAGGAATATGTATCATGGAAGAATAGAGGTTA 

TTCTGTAATGGAAAAGTGTTGCCTGCCACCACCCTCTGTAGAGCTGAGCATTTCTTTTAAATA 

GTCTTCATTGCCAATTTGTTCTTGTAGCAAATGGAACAATGTGGTATGGCTAATTTCTTATTA 

TTAAGTAGTTTATTTTAAAAATATCTGAGTATATTATCCTGTACACTTATCCCTACCTTCATG 

TTCCAGTGGAAGACCTTAGTAAAATCAAAGATCAGTGAGTTCATCTGTAATATTTTTTTTACT 

TGCTTTCTTACTGACAGCAACCAGGAATTTTTTTATCCTGCAGAGCAAGT TTTCAAAATGTAA 

ATACTTCCTCTGTTTAACAGTCCTTGGACCATTCTGATCCAGTTCACCAGTAGGTTGGACAGC 

ATATAATTTGCATCATTTTGTCCCTTGTAAATCAAGATGTTCTGCAGATTATTCCTTTAACGG 

CCGGACTTTTGGCTGTTTCCTAATGAAACATGTAGTGGTTATTATTTAGAGTTTATAGCCGTA 

TTGCTAGCACCTTGTAGTATGTCATCATTCTGCTCATGATTCCAAGGATCAGCCTGGATGCCT 

AGAGGACTAGATCACCTTAGTTTGATTCTATTTTTTAGCTTGCAAAAAGTGACTTATATTCCA 

AAGAAATTAAAATGTTGAAATCCAAATCCTAGAAATAAAATGAGTTTNNTTCCAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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182/550 
FIGURE 182 

MMPSRTNLATGIPSSKVKYSRLSSTDDGYIDLQFKKTPPKIPYKAIALATVLFLIGAFLIIIG 
.SLLLSGYISKGGADPJWPVLIIGILVFLPGFYHLRIAYYASKGYRGYSYDDIPDFDD 

Transmembrane domains : 

amino acids 45-66, 79-95 

N-myristoylation sites. 

amino acids 11-17, 75-81 
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183/550 

FIGURE 183 

CTAAAAAATACAAAAATTAGCTGGGCGTGGTGTCATGTACCTGTAATCCCAGCTACTCAAGAGGCTGAGGCAGGA 

GAATCGCTTGAACCCAGGAGGCAGAGGTTGCAGTGAGCCAAGATTAAGTCACTGCACTCCAGCCTGGGTGACAGA 

GCAAGACTCTGTATCAAAATAAATAAATAAAGTACAACTCTGGATGGGCATGGTGGCTTATGTCTGTAATCCCAG 

CACTTTGGGAACTTGAGGCGGGTAGATTGCTTGAGTCCGGGAGTTTGAGACCAGTCTGGGTAATATGGTAACCCT 

GTCT ACCAAAAAT AC AGGT AT T AGC C AGT CTCATAAC TCGGT CT C AAAAT AAAT AAATACATACAT ACATAGATG 

AAAATTTAAAAAATAAAGTCCAACTCAGCGGTTTTCAGCATATTTACAGAGTTGTACAATCTTCACCACTATCTA 

ATTTCAGAACATTTTCATCACCCCCAAAAGAAACCTAACCCATTGACTATCTCTCCATTTCCTCCCTCTCCCTAG 

CCTCTGGCAACCACTAATCTCTTTTTTGTCTCTATAGATTTGCCTATTTTGGACAGTTCATATACAAGGAATCAT 

ACCACATGTAGCCTTTTGTGTCCGGCTTCTTTGATTAATAGAATGTTTTCAAGGCTCATCTATGCTGTAGCCTGT 

ATCAGCACTTCATTCCTTTCTATGGCTGAATAATAGTCCACTGTAGGGATGTGCCATGTTTTTCCACTAGCTGAT 

GGACATTTGGGTTGTTTCCACCTTCTGGCTATTATAAATATTGCTGCTATAAATATTCACTTACAAGTTTTTGTG 

TGGACATATGTTTTTATTTCTTCTGGTATATCCTTCGGAGTGGAACTGCTGGATCAGGTGGTAACTCTAGGTCTA 

ACCTGGCAGTTAAACAGAATCCTATGCATGCTGTAGTCCATGAGTTGAAATAAACACTTGACCCATAGTAAGTGC 

CAGATCATCTTCATTTCACAGCAACCAGTAATTTCACAGATGAGGAAATGAAGGCTCCCAGAGGTGAACTGGCTT 

TTCCCATTTGAGCAGTTCCAAGTCAGACAGTTAAAAAGTGGCAGGACCTGGAAGAGAAGCTAGTTCTTTCACCCT 

GGCATTCAGGGCTGCCTCCTGGGCTACGGGGCTGGCATTTAGAATAGAGCTAAGGTCTGCTGCCAAGGCAGGTGC 

CCCAGTCTGCCTCCTCTGTGTCCTTATTCCACTTTCTCTGCAGCCCTCCAGGGGACCCCTCTCTCAGCCACCCTC 

TCTCTGGTGATGTCACAGTGCTGCCGGAAGATCAAAGATACGGTGCAGAAACTGGCTTCGGACCATAAGGACATT 

CACAGCAGTGTATCCCGAGTGGGCAAAGCCATTGACAGGAACTTCGACTCTGAGATCTGTGGTGTTGTGTCAGAT 

GCGGTGTGGGACGCGCGGGAACAGCAGCAGCAGATCCTGCAGATGGCCATCGTGGAACACCTGTATCAGCAGGGC 

ATGCTCAGCGTGGCCGAGGAGCTGTGCCAGGAATCAACGCTGAATGTGGACTTGGATTTCAAGCAGCCTTTCCTA 

GAGTTGAATCGAATCCTGGAAGCCCTGCACGAACAAGACCTGGGTCCTGCGTTGGAATGGGCCGTCTCCCACAGG 

CAGCGCCTGCTGGAACTCAACAGCTCCCTGGAGTTCAAGCTGCACCGACTGCACTTCATCCGCCTCTTGGCAGGA 

GGCCCCGCGAAGCAGCTGGAGGCCCTCAGCTATGCTCGGCACTTCCAGCCCTTTGCTCGGCTGCACCAGCGGGAG 

ATCCAGGTGATGATGGGCAGCCTGGTGTACCTGCGGCTGGGCTTGGAGAAGTCACCCTACTGCCACCTGCTGGAC 

AGCAGCCACTGGGCAGAGATCTGTGAGACCTTTACCCGGGACGCCTGTTCCCTGCTGGGGCTTTCTGTGGAGTCC 

CCCCTTAGCGTCAGCTTTGCCTCTGGCTGTGTGGCGCTGCCTGTGTTGATGAACATCAAGGCTGTGATTGAGCAG 

CGGCAGTGCACTGGGGTCTGGAATCACAAGGACGAGTTACCGATTGAGATTGAACTAGGCATGAAGTGCTGGTAC 

GCTCATCTGTGGCCATGTTATCTCCCGAGATGCACTCAATAAGCTCATTAATGGAGGAAACACTCCGTGTTCGCT 

TGCCCCATCCTCCGCCAGCAGACGTCAGATTCCAACCCTCCCATCAAGCTGAAGTGTCCCTACTGTCCCATGGAG 

CAGAACCCGGCAGATGGGAAACGCATCATATTCTGATTCCTACCTGGAAGGAATTTTGTTGAAAGGGGTTTTCAC 

CTGTGAGCCTTGGTCTGTCTCGGTAGGGTGGTCAACTTCAGTGGACTGTGGTTGGTTTCAGAGCGCCTGGCTGAG 

GAGTTCCACTGAGGGGAGCACTGGAGCAGCCCTTTGGCAGAGGCTGAGGAGGGAGATGGACCAGCCCACGCCTGG- 

CACCTGGCTCCATGGCATAAGGAAAGGGAGATGCTGGCCTCTGTGCTCCTGCTGTCTTTTCCTGTTTCTGTTTGC 

GTTTGACTTAGTAGCAACCGACAGAGTGGCAAGGGATTTGGTCTTCAGCAGTAGACATCCTTCCACCCCTGCCCT 

CAGCCAAGTCTCTTGCTGCCATGCCAATGCTATGTCCACCCTTGCCCCTCGGCCCAAGAGTGTCCAGCGGTGGCC 

CACCTCTTCCTCCCACTACAGCCTCAACAGTATGTACCATCTCCCACTGTAAATAGTCCCAGTTAGAACGGAATG 

CCGTTGTTTTATAACTTTGAACAAATGTATTTACTGCCCTTCTCAAAA 
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FIGURE 184 

QCCRKIKDTVQKLASDHKDIHSSVSRVGKAIDRNFDSEICGVVSDAVWDAREQQQQILQMAIV 
EHLYQQGMLSVAEELCQESTLNVDLDFKQPFLELNRILEALHEQDLGPALEWAVSHRQRLLEL 
NSSLEFKLHRLHFIRLLAGGPAKQLEALSYARHFQPFARLHQREIQVMMGSLVYLRLGLEKSP 
YCHLLDSSHWAEICETFTRDACSLLGLSVESPLSVSFASGCVALPVLMNIKAVIEQRQCTGVW 
NHKDELPIEIELGMKCWYHSVFACPILRQQTSDSNPPIKLICGHVISRDALNKLINGGKLKCP 

YCPMEQN PADGKRI I F 

Transmembrane domain: 

amino acids 222-241 

N-glycosylation site. 

amino acids 129-133 

Tyrosine kinase phosphorylation site. 

amino acids 151-159, 184-193 

Amidation site . 

amino acids 327-331 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 222-233 
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FIGURE 185 

GAGCGACGCTGTCTCTAGTCGCTGATCCCAAATGCACCGGCTCATCTTTGTCTACACTCTAAT 
CTGCGCAAACTTTTGCAGCTGTCGGGACACTTCTGCAACCCCGCAGAGCGCATCCATCAAAGC 
TTTGCGCAACGCCAACCTCAGGCGAGATGACTTGTACCGAAGAGATGAGACCATCCAGGTGAA 
AGGAAACGGCTACGTGCAGAGTCCTAGATTCCCGAACAGCTACCCCAGGAACCTGCTCCTGAC 
ATGGCGGCTTCACTCTCAGGAGAATACACGGATACAGCTAGTGTTTGACAATCAGTTTGGATT 
AGAGGAAGCAGAAAATGATATCTGTAGGTATGATTTTGTGGAAGTTGAAGATATATCCGAAAC 
CAGTACCATTATTAGAGGACGATGGTGTGGACACAAGGAAGTTCCTCCAAGGATAAAATCAAG 
AACGAACCAAATTAAAATCACATTCAAGTCCGATGACTACTTTGTGGCTAAACCTGGATTCAA 
GATTTATTATTCTTTGCTGGAAGATTTCCAACCCGCAGCAGCTTCAGAGACCAACTGGGAATC 
TGTCACAAGCTCTATTTCAGGGGTATCCTATAACTCTCCATCAGTAACGGATCCCACTCTGAT 
TGCGGATGCTCTGGACAAAAAAATTGCAGAATTTGATACAGTGGAAGATCTGCTCAAGTACTT 
CAATCCAGAGTCATGGCAAGAAGATCTTGAGAATATGTATCTGGACACCCCTCGGTATCGAGG 
CAGGTCATACCATGACCGGAAGTCAAAAGTTGACCTGGATAGGCTCAATGATGATGCCAAGCG 
TTACAGTTGCACTCCCAGGAATTACTCGGTCAATATAAGAGAAGAGCTGAAGTTGGCCAATGT 
GGTCTTCTTTCCACGTTGCCTCCTCGTGCAGCGCTGTGGAGGAAATTGTGGCTGTGGAACTGT 
CAACTGGAGGTCCTGCACATGCAATTCAGGGAAAACCGTGAAAAAGTATCATGAGGTATTACA 
GTTTGAGCCTGGCCACATCAAGAGGAGGGGTAGAGCTAAGACCATGGCTCTAGTTGACATCCA 
GTTGGATCACCATGAACGATGCGATTGTATCTGCAGCTCAAGACCACCTCGATAAGAGAATGT 
GCACATCCTTACATTAAGCCTGAGAGAA 
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FIGURE 186 

MHRLIFVYTLICANFCSCRDTSATPQSASIKALRNANLRRDDLYRRDETIQVKGNGYVQSPRF 
PNSYPRNLLLTWRLHSQENTRIQLVFDNQFGLEEAENDICRYDFVEVEDISETSTIIRGRWCG 
HKEVPPRIKSRTNQIKITFKSDDYFVAKPGFKIYYSLLEDFQPAAASETNWESVTSSISGVSY 
NSPSVTDPTLIADALDKKIAEFDTVEDLLKYFNPESWQEDLENMYLDTPRYRGRSYHDRKSKV 
DLDRLNDDAKRYSCTPRNYSVNIREELKLANWFFPRCLLVQRCGGNCGCGTVNWRSCTCNSG 
KTVKKYHEVLQFEPGHIKRRGRAKTMALVDIQLDHHERCDCICSSRPPR 

Signal peptide: 

amino acids 1-18 

N-glycosylation site. 

amino acids 270-274 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 262-266 

Tyrosine kinase phosphorylation site. 

amino acids 256-265 

♦ 

N-myristoylation sites. 

amino acids 94-100, 186-192, 297-303, 298-304 

TonB-dependent receptor proteins signature 1. 

amino acids 1-56 
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187/550 
FIGURE 187 

CATGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTG 

TGTCTCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGT 

CCTACAATGGACTCGACCAGAGGGTCTTCAAGGAGTTAAAGTTACTTACACTGTGCAGTATTT 

CATATATGGGCAAAAGAAATGGCTGAATAAATCAGAATGCAGAAATATCAATAGAACCTACTG 

TGATCTTTCTGCTGAAACTTCTGACTACGAACACCAGTATTATGCCAAAGTTAAGGCCATTTG 

GGGAACAAAGTGTTCCAAATGGGCTGAAAGTGGACGGTTCTATCCTTTTTTAGAAACACAAAT 

TGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 

AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCT 

GAAGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAA 

CCACACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTT 

CGTCCCAGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGA 

TCAATCATCAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGT 

GTTTCTTTTTTCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACA 

CCCAGCAAATTTGATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGA 

AAAAATCGTGATTAACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGA 

TATGAGTTTACTGGGAAAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAA 

CCTGAGGCCC.CCTCAGGAGGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGA 

AATTTTTTGTGACTCTGAAGAAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAG 

CAGAACAATACCCCCGGATAAAACAGTCATTGAATATGAATATGATGTCAGAACCACTGACAT 

TTGTGCGGGGCCTGAAGAGCAGGAGCTCAGTTTGCAGGAGGAGGTGTCCACACAAGGAACATT 

ATTGGAGTCGCAGGCAGCGTTGGCAGTCTTGGGCCCGCAAACGTTACAGTACTCATACACCCC 

TCAGCTCCAAGACTTAGACCCCCTGGCGCAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGA 

AGAGCCATCGACGACCCTGGTCGACTGGGATCCCCAAACTGGCAGGCTGTGTATTCCTTCGCT 

GTCCAGCTTCGACCAGGATTCAGAGGGCTGCGAGCCTTCTGAGGGGGATGGGCTCGGAGAGGA 

GGGTCTTCTATCTAGACTCTATGAGGAGCCGGCTCCAGACAGGCCACCAGGAGAAAATGAAAC 

CTATCTCATGCAATTCATGGAGGAATGGGGGTTATATGTGCAGATGGAAAACI^TGCCAACA 

CTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAGTCACCCCTTTGATCCCAGCCATAAA 

GTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGTCTGTGAGAA 
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FT CURE 188 

MPLPPLLLLLLAAPWGRAVPCVSGGLPKPANITFLSINMKNVLQWTPPEGLQGVKVTYTVQYF 
IYGQKKWLNKSECRNINRTYCDLSAETSDYEHQYYAKVKAIWGTKCSKWAESGRFYPFLETQI 
GPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIYSNLKYNVSVLNTKSNRTWSQCVTN 
HTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLKDQSSEFKAKIIFWYVLPISITV 
FLFSVMGYSIYRYIHVGKEKHPANLILIYGNEFDKRFFVPAEKIVINFITLNISDDSKISHQD 
MSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFCDSEENTEGTSLTQQESLS 
RTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQAALAVLGPQTLQYSYTP 
QLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDSEGCEPSEGDGLGEE 

GLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Signal sequence: 

amino acids 1-18 

Transmembrane domain : 

amino acids 240-2 60 

N-glycosylation sites . 

amino acids 31-34, 72-75, 80-8-3, 171-174, 180-183, 189-192, 
304-307, 523-526 

Tyrosine kinase phosphorylation site. 

amino acids 385-392, 518-526 

N-myristoylation sites. 

amino acids 53-58, 106-111, 368-373, 492-497 



Tissue factor 

amino acids 1-278 
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FIGURE 189 

rrTOTGTTCC^GAA^GCATCATGTAATAAAGGTTGAGCCAGTGATACAGAGAGGCCATGAG 
arTrTGGTGCA^CACATCCTGCTCTATCAGTGCAGCAACAACTTTAACGAC^ 

SpSggg™^ 

Ic?cca??agSc?g?a??^ 

ggStSgmaa?tctggactgaggt^ 

rGGGTGATTGAGGCTGGCCTCTGGGTGAGCCTCTTCCATACCATCCCTCCAGGGA 

??cc!gtc?gagggt^^^ 

Ig5ggaaSca?gtgtttgctgttcttctccatgctc 

rrTCA^TTCG^AAAAGGGAAGGAAATGAAATTACTTGCCTATGATGATGATTTTGACTTC 

Sc^agIa^?^^ 

rArTrTCGCTACAACACGAAAGATAGAGCTGAGATGACTTGGGGAGGACTAAGCACCAGGAGT 
rRAATGTGTCTCTC^ATAC^CTTC^TT 

rr^ACATTATGGAACAACTTCAGTTCATTGGGGTTAAGGAGATCTACAGACCA 

aTTTrAAGTACAGGTTAAAGACTGTGTCCACTTTGGGCATGAAGAGTGTGGAGACTTTTCTTC 

rrrAT^TCCCTCCCTCCTTTTTCCTTTCCATGTTACATGAGAGACA^ 

CTCTTTCTTAGAA^TACCTGATGTTATAT 

A^GATAACCATT^AAAAAATTGGGCTGTCATGTGGGAATAAAAGAATTCTTTCTTTCCTAAA 
AAAAAAAA 
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FIGURE 190 

MCCWPLLLLWGLLPGTAAGGSGRTYPHRTLLDSEGKYWLGWSQRGSQIAFRLQVRTAGYVGFG 
FSPTGAMASADIWGGVAHGRPYLQDYFTNANRELKKDAQQDYHLEYAMENSTHTIIEFTREL 
HTCDINDKSITDSTVRVIWAYHHEDAGEAGPKYHDSNRGTKSLRLLNPEKTSVLSTALPYFDL 
VNQDVPIPNKDTTYWCQMFKIPVFQEKHHVIKVEPVIQRGHESLVHHILLYQCSNNFNDSVLE 
SGHECYHPNMPDAFLTCETVIFAWAIGGEGFSYPPHVGLSLGTPLDPHYVLLEVHYDNPTYEE 
GLIDNSGLRLFYTMDIRKYDAGVIEAGLWVSLFHTIPPGMPEFQSEGHCTLECLEEALEAEKP 
SGIHVFAVLLHAHLAGRGIRLRHFRKGKEMKLLAYDDDFDFNFQEFQYLKEEQTILPGDNLIT 
ECRYNTKDRAEMTWGGLSTRSEMCLSYLLYYPRINLTRCASIPDIMEQLQFIGVKEIYRPVTT 
WPFIIKSPKQYKNLSFMDAMNKFKWTKKEGLSFNKLVLSLPVNVRCSKTDNAEWSIQGMTALP 

PDIERPYKAEPLVCGTSSSSSLHRDFSINLLVCLLLLSCTLSTKSL 

Signal peptide: 

amino acids 1-18 

Transmembrane domains : 

amino acids 56-73, 378-393, 583-602 

N-glycosylation sites. 

amino acids 114-118, 247-251, 476-480, 517-521 
N-myristoylation sites. 

amino acids 11-17, 15-21, 20-26, 45-51, 68-74, 79-85, 290-296, 
316-322, 337-343, 342-348, 456-462, 534-540, 582-588 

Copper type II, ascorbate-dependent monooxygenases proteins. 

amino acids 271-321, 422-474 
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FIGURE 191 

GCTTCAGCTGAAGAAAGAGAGGAATGAAGCGCCTTCTGCTTCTGTTTTTGTTCTTTATAACAT 
TTTCTTCTGCATTTCCCTTAGTCCGGATGACGGAAAATGAAGAAAATATGCAACTGGCTCAGG 
CATATCTCAACCAGTTCTACTCTCTTGAAATAGAAGGGAATCATCTTGTTCAAAGCAAGAATA 
GGAGTCTCATAGATGACAAAATTCGGGAAATGCAAGCATTTTTTGGATTGACAGTGACTGGAA 
AACTGGACTCAAACACCCTTGAGATCATGAAGACACCCAGGTGTGGGGTGCCTGATGTGGGCC 
AGTATGGCTACACCCTCCCTGGGTGGAGAAAATACAACCTCACCTACAGAATAATAAACTATA 
CTCCGGATATGGCACGAGCTGCTGTGGATGAGGCTATCCAAGAAGGTTTAGAAGTGTGGAGCA 
AAGTCACTCCACTAAAATTCACCAAGATTTCAAAGGGGATTGCAGACATCATGATTGCCTTTA 
GGACTCGAGTCCATGGTCGGTGTCCTCGCTATTTTGATGGTCCCTTGGGAGTGCTTGGCCATG 
CCTTTCCTCCTGGTCCGGGTCTGGGTGGTGACACTCATTTTGATGAGGATGAAAACTGGACCA 
AGGATGGAGCAGGATTCAACTTGTTTCTTGTGGCTGCTCATGAATTTGGTCATGCACTGGGGC 
TCTCTCACTCCAATGATCAAACAGCCTTGATGTTCCCAAATTATGTCTCCCTGGATCCCAGAA 
AATACCCACTTTCTCAGGATGATATCAATGGAATCCAGTCCATCTATGGAGGTCTGCCTAAGG 
TACCTGCTAAGCCAAAGGAACCCACTATACCCCATGCCTGTGACCCTGACTTGACTTTTGACG 
CTATCACAACTTTCCGCAGAGAAGTAATGTTCTTTAAAGGCAGGCACCTATGGAGGATCTATT 
ATGATATCACGGATGTTGAGTTTGAATTAATTGCTTCATTCTGGCCATCTCTGCCAGCTGATC 
TGCAAGCTGCATACGAGAACCCCAGAGATAAGATTCTGGTTTTTAAAGATGAAAACTTCTGGA 
TGATCAGAGGATATGCTGTCTTGCCAGATTATCCCAAATCCATCCATACATTAGGTTTTCCAG 
GACGTGTGAAGAAAATAGATGCAGCCGTCTGTGATAAGACCACAAGAAAAACCTACTTCTTTG 
TGGGCATTTGGTGCTGGAGGTTTGATGAAATGACCCAAACCATGGACAAAGGATTCCCGCAGA 
GAGTGGTAAAACACTTTCCTGGAATCAGTATCCGTGTTGATGCTGCTTTCCAGTACAAAGGAT 
TCTTCTTTTTCAGCCGTGGATCAAAGCAATTTGAATACAACATTAAGACAAAGAATATTACCC 
GAATCATGAGAACTAATACTTGGTTTCAATGCAAAGAACCAAAGAACTCCTCATTTGGTTTTG 
ATATCAACAAGGAAAAAGCACATTCAGGAGGCATAAAGATATTGTATCATAAGAGTTTAAGCT 
TGTTTATTTTTGGTATTGTTCATTTGCTGAAAAACACTTCTATTTATCAATAAATTCATAGAC 
CTAAAATAAACCTCAACAGGTCTTTTAATATAAATTCTGCTTCAAAATAGAATAAAACCATTC 

TTTAACAAC 
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FIGURE 192 

MKRLLLLFLFFITFSSAFPLVRMTENEENMQLAQAYLNQFYSLEIEGNHLVQSKNRSLIDDKI 
REMQAFFGLTVTGKLDSNTLEIMKTPRCGVPDVGQYGYTLPGWRKYNLTYRIINYTPDMARAA 
VDEAIQEGLEVWSKVTPLKFTKISKGIADIMIAFRTRVHGRCPRYFDGPLGVLGHAFPPGPGL 
GGDTHFDEDENWTKDGAGFNLFLVAAHEFGHALGLSHSNDQTALMFPNYVSLDPRKYPLSQDD 
INGIQSIYGGLPKVPAKPKEPTIPHACDPDLTFDAITTFRREVMFFKGRHLWRIYYDITDVEF 
ELIASFWPSLPADLQAAYENPRDKILVFKDENFWMIRGYAVLPDYPKSIHTLGFPGRVKKIDA 
AVCDKTTRKTYFFVGIWCWRFDEMTQTMDKGFPQRWKHFPGISIRVDAAFQYKGFFFFSRGS 
KQFEYNIKTKNITRIMRTNTWFQCKEPKNSSFGFDINKEKAHSGGIKILYHKSLSLFIFGIVH 

LLKNTSIYQ 

Signal peptide: 

amino acids 1-17 

N-glycosylafcion sites . 

amino acids 55-59, 110-114, 200-204, 452-456, 470-474, 508-512 

N-myristoylation site. 

amino acids 71-77, 205-211, 223-229 

Hemopexin domain signature. 

amino acids 171-202, 207-238, 318-334 

Neutral zinc metallopeptidases , zinc-binding region signature. 

amino acids 213-223 



Matrixins cysteine switch. 

amino acids 89-97, 207-238 
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FIGURE 193 

CACAATCAGGTCCCATTCTATAGATGGGGAAACTGAGGCTTGAGGTCACATAGGCGTCGTTCA 
AGGCTGGTATACCTGCACCCTCTCCCATGTGAACAACATGGTTCTGGGTAATGGGGGCTGTCA 
TCCAGTCTCCTCCCTGCCCCTGCTGGTGCACTTCCTGCCTCTGCTGGTGCACTTTCTGCCCCT 
ACTGGTATATTTGCTGCCTCTGCTGGGGCGCTTCCTGCCTCGGCTGGTGTATCTCCTGCCCCT 
GCTGGTGCACTTTCTGCCCCCGCTGATGCACTTCCTGCCTCTGCTGGTGCACTTCCTGGCTCT 
GCTGGCACACTTCCTGCCTCTGCTGGTGCACTTCCTGGCTCTGCTGGCGCACTTTCCTGCCCC 
TGCTGGTGTATTTCCTGCCCCTGCTGGTGTACTTCCTTCCCCTGCTGGTGCACTTCCTGCCTC 
TGCTGGCGCACTTCTTGCCTCTCCAGGCCCTACCTAGCCTCTCCCTCTTATATATGGAAGTCT 
TCCCAGTTCACTGACACTGGTAACAGGGACTCTGCTCTTGGTGTTGCTGTCTGCCCTGGGGAT 
GGGCATCTGTGTCTTCCTTTACTACTGCTGGCTCAGGACCCAGAGCTTTGAAGCATGTCCAGA 
TGCAGGTCCGGGCACCAGAGTCTAAGGAGCCCCTACACCCACCAGGATTTTCCAATAAAGAGA 
TGTTCACCA 
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FIGURE 194 

MVLGNGGCHPVSSLPLLVHFLPLLVHFLPLLVYLLPLLGRFLPRLVYLLPLLVHFLPPLMHFL 
PLLVHFLALLAHFLPLLVHFLALLAHFPAPAGVFPAPAGVLPSPAGALPASAGALLASPGPT 

Signal peptide: 

amino acids 1-39 

N-myristoylation sites. 

amino acids 4-10, 109-115, 116-122 

Leucine zipper pattern. 

amino acids 14-36, 16-38, 17-39, 21-43, 24-46, 28-50, 31-53, 

35-57, 38-60, 42-64, 45-67, 49-71, 52-74, 56-78, 59-81, 63-85, 
65-87, 66-88 
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FTftURE 195 

GGCAAGGCGGCGGCGGCGGCGGCGGCAGCCGCGGTGGCGGCGTGGGGAACATCTCGGCAGCCA 

CCGCGCTTCTCCCGCTGGAGCGGGCGTCCAGCTTGGCTGCCCTCGGTCCTTCCCTGCCACGTT 

TCGGGTCGCCCTGCACCCCCCACCCAGGCTCGCTTCTCTTCGAAGCGGGAAGGGCGCCTTGCA 

GGATCCTGCCGCCCCTCCAACCGGATGCTGGGTCTAGAGCTCCCCAGAGCGAGGCGCT^ 

GGACTCCTGCCCCGCCAACCCTGACCGCCGGGGGGTGCCCCCGGGACGTAGCGCCGCGGAGAG 

GAAGCGGCAAAGGGGACCAT6CGGCGCCTGACTCGTCGGCTGGTTCTGCCAGTCTTCGGGGTG 

CTCTGGATCACGGTGCTGCTGTTCTTCTGGGTAACCAAGAGGAAGTTGGAGGTGCCGACGGGA 

CCTGAAGTGCAGACCCCTAAGCCTTCGGACGCTGACTGGGACGACCTGTGGGACCAGTTTGAT 

GAGCGGCGGTATCTGAATGCCAAAAAGTGGCGCGTTGGTGACGACCCCTATAAGCTGTATGCT 

TTCAACCAGCGGGAGAGTGAGCGGATCTCCAGCAATCGGGCCATCCCGGACACTCGCCATCTG 

Iga^cacactgctggtgtattgcacggaccttccacccactagcatcatcatcac 

A^CGAGGCCCGCTCCACGCTGCTCAGGACCATCCGCAG 

CTGATCCGGGAAATCATATTAGTGGATGACTTCAGCAATGACCCTGATGACTGTAAACAGCTC 

a??a^g?tgcSggtgaaatgcttgcgcaataatgaacggcaaggtctggtc 

a^c^gLcgctgacatcgcccagggcaccactctgactttcctcgacagccactgtga^ 

aacagggactggctccagcctctgttgcacagggtcaaagaggactacacgcgggtggtgtgc 

cctgtgatcgatatcattaacctggacaccttcacctacatcgagtctgcctcggagctcaga 

ggggggtttgactggagcctccacttccagtgggagcagctctccccagagcagaaggctcgg 

cgcctggaccccacggagcccatcaggactcctatcatagctggagggctcttcgtgatc^ 

aaagcttggtttgattacctggggaaatatgatatggacatggacatctggggtggggagaac 

tttgaaatctccttccgagtgtggatgtgcgggggcagcctagagatcgtcccctgcagccga 

gtcgg^acgtcttccggaagaagcacccctacgttttccctgatggaaatgccaaca^ 

ataaagaacaccaagcggacagctgaagtgtggatggatgaatacaagcaatactattac^ 

g™gccattcgccctggagaggcccttcgggaatgttgagagcagattggacctgaggaag 

.^ctgcgctgccagagcttcaagtggtacctggagaatatctaccctgaactcagca^ 

aaggagtcctccatccagaagggcaatatccgacagagacagaagtgcctggaatctcaaagg 

cagaacaaccaagaaaccccaaacctaaagttgagcccctgtgccaaggtcaaaggcgaagat 

gS^ccagg^gggccttcacatacacccagcagatcctccaggaggagctgtgcctg 

?ca1?catcaccttgttccctggcgccccagtggttcttgtcctttgcaag^ 

?GAcS?GGACCAAAACTG 

ACAGATA^TTCGGTGATGGCACCGAGAACGGCAAGGAAATCGTCGTCAACCCATG^ 

TCACTCATGAGCCAGCACTGGGACATGGTGAGCTCTTGAGGACCCCTGCCAGAAGCAGCAAGG 

GCCATGGGGTGGTGCTTCCCTGGACCAGAACAGACTGGAAACTGGGCAGCAAGCAGCCTGCAA 

ccacctcagacatcctggactgggaggtggaggcagagccccccaggacaggagcaactgtct 

cagggIggacagaggaa^ 
aggcSSgttcca^ctggccagtc 

tgggaagtgtttattgttccttttcctacaaaggaagcagtctctggaggccagaaagaaaag 
ccttctttttcactaggccaggactacattgagagatgaagaatggaggttgtttccaaaaga 
aataaagagaaacttagaagttgtctctggaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 

AAAAAAAAAAAAAAAAAA 
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FT CURE 196 

MRRLTRRLVLPVFGVLWITVLLFFWVTKRKLEVPTGPEVQTPKPSDADWDDLWDQFDERRYLN 

AKKWRVGDDPYKLYAFNQRESERI S SNRAI PDTRHLRCTLLVYCTDL PPTS 1 1 ITFHNEARST 
LLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRGADI 
AQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRWCPVIDIINLDTFTYIESASELRGGFDWS 
LHFQWEQLSPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFR 
VWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARPFAL 
ERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQKGNIRQRQKCLESQRQNNQET 
PNLKLSPCAKVKGEDAKSQVWAFTYTQQILQEELCLSVITLFPGAPWLVLCKNGDDRQQWTK 
TGSHIEHIASHLCLDTDMFGDGTENGKEIWNPCESSLMSQHWDMVSS 

Transmembrane domain : 

amino acids 475-493 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 2-6 

Tyrosine kinase phosphorylation sites. 

amino acids 68-75, 401-409 

N-myristoylation sites . 

amino acids 178-184, 186-192, 192-198, 346-352, 383-389, 526-532 
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197/550 



FIGURE 197 

rrAGCTCACCCTTCGCAGCCGCGATGGGGGAAGACGACGCCGCGCTTCGGGCTGGCAGCAGGGGGCTCTCCGACC 

SSgcagScagtgggaS 
SSgSSgtccct^^ 

rrrGGGCGAGTGCCACGCCAGGCGCC 

, TrarrTrrTGGG CTTCTTCCGCAGCTCCTATGTGCTCCACGGGGAGAGAAGATTCCTTGGTGTTACiLA^llii 
S^ACACMGCCAGAAAGGCATTTCCTTGTTTTGATG 

agcaS^^ 

rrr aTrACTTTOCACAGACCCCTCTCATGTCCACATATTATTTAGCCTGGGCA^ 

aaacScaS^^^ 
aSct^ataacS^ 

ggaa?aS?at?ggaggcttt 

rATRCCTGTCTGAGGAG^GGATTTTCTTCCTTGGCATGC 
TrrarrrCATGGAAAAC^C^CATTTTCAATGAATAT^ 

SggccSa^^ 

TAATGCTGGCC^CAGTTTTMCAACAAGCACTGTCACCAACAGGCATCAACACTTATT 

T^TTTTTACT^TTATTTT^GGTTTTGGGGGATATTTTTTATT^ 
AGGCATATAGATAGTCTATGCATATGTATM 

acaSSca^g^gt^ 

•prnnrrnATTTATrTrAACATTATTCTTTCTATGTCCTAACTAAATTTCTCAACTGTTATGAATTTTTCAICi 
AAAAAAAAAA 
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FIGURE 198 

MGEDDAALRAGSRGLSDPWADSVGVRPRTTERHIAVHKRLVLAFAVSLVALLAVTMLAVLLSL 

RFDECGASATPGADGGPSGFPERGGNGSLPGSARRNHHAGGDSWQPEAGGVASPGTTSAQP.PS 

EEEREPWEPWTQLRLSGHLKPLHYNLMLTAFMENFTFSGEVNVEIACRNATRYVVLHASRVAV 

EKVQLAEDRAFGAVPVAGFFLYPQTQVLVWLNRTLDAQRNYNLKIIYNALIENELLGFFRSS 

YVLHGERRFLGVTQFSPTHARKAFPCFDEPIYKATFKISIKHQATYLSLSNMPVETSVFEEDG 

WVTDHFSQTPLMSTYYLAWAICNFTYRETTTKSGWVRLYARPDAIRRGSGDYALHITKRLIE 

FYEDYFKVPYSLPKLDLLAVPKHPYAAMENWGLSIFVEQRILLDPSVSSISYLLDVTMVIVHE 

ICHQWFGDLVTPVWWEDVWLKEGFAHYFEFVGTDYLYPGWNMEKQRFLTDVLHEVMLLDGLAS 

SHPVSQEVLQATDIDRVFDWIAYKKGAALIRMLANFMGHSVFQRGLQDYLTIHKYGNAARNDL 

WNTLSEALKRNGKYVNIQEVMDQWTLQMGYPVITILGNTTAENRIIITQQHFIYDISAKTKAL 

KLQNNSYLWQIPLTIWGNRSHVSSEAIIWVSNKSEHHRITYLDKGSWLLGNINQTGYFRVNY 

DLRNWRLLIDQLIRNHEVLSVSNRAGLIDDAFSLARAGYLPQNIPLEIIRYLSEEKDFLPWHA 

ASRALYPLDKLLDRMENYNIFNEYILKQVATTYIKLGWPKNNFNGSLVQASYQHEELRREVIM 

LACSFGNKHCHQQASTLISDWISSNRNRIPLNVRDIVYCTGVSLLDEDVWEFIWMKFHSTTAV 

SEKKILLEALTCSDDRNLLNRLLNLSLNSEWLDQDAIDVIIHVARNPHGRDLAWKFFRDKWK 

ILNTRYGEALFMYSKLISGVTEFLNTEGELKELKNFMKNYDGVAAASFSRAVETVEANVRWKM 

LYQDELFQWLGKALRH 

Transmembrane domain : 

amino acids 44-63 
N-glycosylation sites . 

amino acids 89-93, 160-164, 175-179, 222-226, 338-342, 605-609, 

634-638, 649-653, 663-667, 684-688, 800-804, 906-910 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 362-366 

Tyrosine kinase phosphorylation site. 

amino acids 520-528 
N-myristoylation sites. 

amino acids 78-84, 87-93, 90-96, 118-124, 501-507, 604-610, 
825-831,; 987-993 

Neutral sine metallopeptidases , zinc-binding region signature. 

amino acids 437-447 
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FIGURE 199 

GCGCCCGGCGCAGCTCGGCCAGAGCGACCGCGGGGCTGAGCGCGCGTCCGCCCAGGGGGCTCCGGAAGCTGCCCC 
GGCCCGCGGCCTCCTCCCTCGCTCCCGCTTCCCCTTTCTCGCTCACCGCCGCCCTCCTTCCCCAGCTCCCTCGCC 
GTCCGCCCGCCCCACAGCCAGCGGCTCCGCGCCCCCTGCAGCCACGATGCCCGCGGCCCGGCCGCCCGCCGCGGG 
ACTCCGCGGGATCTCGCTGTTCCTCGCTCTGCTCCTGGGGAGCCCGGCGGCAGCGCTGGAGCGAGATGCTCTTCC 
CGAGGGAGATGCTAGCCCTTTGGGTCCTTACCTCCTGCCCTCAGGAGCCCCGGAGAGAGGCAGTCCTGGCAAAGA 
GCACCCTGAAGAGAGAGTGGTAACAGCGCCCCCCAGTTCCTCACAGTCGGCGGAAGTGCTGGGCGAGCTGGTGCT 
GGATGGGACCGCACCCTCTGCACATCACGACATCCCAGCCCTGTCACCGCTGCTTCCAGAGGAGGCCCGCCCCAA 
GCACGCCTTGCCCCCCAAGAAGAAACTGCCTTCGCTCAAGCAGGTGAACTCTGCCAGGAAGCAGCTGAGGCCCAA 
GGCCACCTCCGCAGCCACTGTCCAAAGGGCAGGGTCCCAGCCAGCGTCCCAGGGCCTAGATCTCCTCTCCTCCTC 
CACGGAGAAGCCTGGCCCACCGGGGGACCCGGACCCCATCGTGGCCTCCGAGGAGGCATCAGAAGTGCCCCTTTG 
GCTGGATCGAAAGGAGAGTGCGGTCCCTACAACACCCGCACCCCTGCAAATCTCCCCCTTCACTTCGCAGCCCTA 
TGTGGCCCACACACTCCCCCAGAGGCCAGAACCCGGGGAGCCTGGGCCTGACATGGCCCAGGAGGCCCCCCAGGA 
GGACACCAGCCCCATGGCCCTGATGGACAAAGGTGAGAATGAGCTGACTGGGTCAGCCTCAGAGGAGAGCCAGGA 
GACCACTACCTCCACCATTATCACCACCACGGTCATCACCACCGAGCAAGCACCAGCTCTCTGCAGTGTGAGCTT 
CTCCAATCCTGAGGGGTACATTGACTCCAGCGACTACCCACTGCTGCCCCTCAACAACTTTCTGGAGTGCACATA 
CAACGTGACAGTCTACACTGGCTATGGGGTGGAGCTCCAGGTGAAGAGTGTGAACCTGTCCGATGGGGAACTGCT 
CTCCATCCGCGGGGTGGACGGCCCTACCCTGACCGTCCTGGCCAACCAGACACTCCTGGTGGAGGGGCAGGTAAT 
CCGAAGCCCCACCAACACCATCTCCGTCTACTTCCGGACCTTCCAGGACGACGGCCTTGGGACCTTCCAGCTTCA 
CTACCAGGCCTTCATGCTGAGCTGCAACTTTCCCCGCCGGCCTGACTCTGGGGATGTCACGGTGATGGACCTGCA 
CTCAGGTGGGGTGGCCCACTTTCACTGCCACCTGGGCTATGAGCTCCAGGGCGCTAAGATGCTGACATGCATCAA 
TGCCTCCAAGCCGCACTGGAGCAGCCAGGAGCCCATCTGCTCAGCTCCTTGTGGAGGGGCAGTGCACAATGCCAC 
CATCGGCCGCGTCCTCTCCCCAAGTTACCCTGAAAACACAAATGGGAGCCAATTCTGCATCTGGACGATTGAAGC 
TCCAGAGGGCCAGAAGCTGCACCTGCACTTTGAGAGGCTGTTGCTGCATGACAAGGACAGGATGACGGTTCACAG 
CGGGCAGACCAACAAGTCAGCTCTTCTCTACGACTCCCTTCAAACCGAGAGTGTCCCTTTTGAGGGCCTGCTGAG 
CGAAGGCAACACCATCCGCATCGAGTTCACGTCCGACCAGGCCCGGGCGGCCTCCACCTTCAACATCCGATTTGA 
AGCGTTTGAGAAAGGCCACTGCTATGAGCCCTACATCCAGAATGGGAACTTCACTACATCCGACCCGACCTATAA 
CATTGGGACTATAGTGGAGTTCACCTGCGACCCCGGCCACTCCCTGGAGCAGGGCCCGGCCATCATCGAATGCAT 
CAATGTGCGGGACCCATACTGGAATGACACAGAGCCCCTGTGCAGAGCCATGTGTGGTGGGGAGCTCTCTGCTGT 
GGCTGGGGTGGTATTGTCCCCAAACTGGCCCGAGCCGTACGTGGAAGGTGAAGATTGTATCTGGAAGATCCACGT 
GGGAGAAGAGAAACGGATCTTCTTAGATATCCAGTTCCTGAATCTGAGCAACAGTGACATCTTGACCATCTACGA 
TGGCGACGAGGTCATGCCCCACATCTTGGGGCAGTACCTTGGGAACAGTGGCCCCCAGAAACTGTACTCCTCCAC 
GCCAGACTTAACCATCCAGTTCCATTCGGACCCTGCTGGCCTCATCTTTGGAAAGGGCCAGGGATTTATCATGAA 
CTACATAGAGGTATCAAGGAATGACTCCTGCTCGGATTTACCCGAGATCCAGAATGGCTGGAAAACCACTTCTCA 
CACGGAGTTGGTGCGGGGAGCCAGAATCACCTACCAGTGTGACCCCGGCTATGACATCGTGGGGAGTGACACCCT 
CACCTGCCAGTGGGACCTCAGCTGGAGCAGCGACCCCCCATTTTGTGAGAAAATTATGTACTGCACCGACCCCGG 
AGAGGTGGATCACTCGACCCGCTTAATTTCGGATCCTGTGCTGCTGGTGGGGACCACCATCCAATACACGTGCAA 
CCCCGGTTTTGTGCTTGAAGGGAGTTCTCTTCTGACCTGCTACAGCCGTGAAACAGGGACTCCCATCTGGACGTC 
TCGCCTGCCCCACTGCGTTTCGGAGGAGTCCCTGGCATGTGACAACCCAGGGCTGCCTGAAAATGGATACCAAAT 
CCTGTACAAGCGACTCTACCTGCCAGGAGAGTCCCTCACCTTCATGTGCTACGAAGGCTTTGAGCTCATGGGTGA 
AGTGACCATCCGCTGCATCCTGGGACAGCCATCCCACTGGAACGGGCCCCTGCCCGTGTGTAAAGTTAATCAAGA 
CAGTTTTGAACATGCTTTAGAAGCAGAAGCGGCAGCAGAGACGTCGCTGGAAGGGGGGAACATGGCCCTGGCTAT 
CTTCATCCCGGTCCTCATCATCTCCTTACTGCTGGGAGGAGCCTACATTTACATCACAAGATGTCGCTACTATTC 
CAACCTCCGCCTGCCTCTGATGTACTCCCACCCCTACAGCCAGATCACCGTGGAAACCGAGTTTGACAACCCCAT 
TTACGAGACAGGGGAAACCAGAGAGTATGAGGTTTCTATCTAAAGAGAGCTACACTTGAGAAGGGGACTTGTGAA 
CTCAACCACAATCTCCTCGAGACATTCATCCAGAGACCATGTGGCACTTGATTGAAACCCCAGAATGTCGACTGT 
CTTTTGTTTAGACTCTTTATCAAAGGTTTACTGTTTTCTTCCCTGTATTTATTATATTTAAAAGTGAAAAAAAAA 

AAAAAAAAAAA 
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FIGURE 200 

MPAARPPAAGLRGISLFIALLLGSPAAALERDALPEGDASPLGPYLLPSGAPERGSPGKEHPE 
ERWTAPPSSSQSAEVLGELVLDGTAPSAHHDIPALSPLLPEEARPKHALPPKKKLPSLKQVN 
SARKQLRPKATSAATVQRAGSQPASQGLDLLSSSTEKPGPPGDPDPIVASEEASEVPLWLDRK 
ESAVPTTPAPLQISPFTSQPYVAHTLPQRPEPGEPGPDMAQEAPQEDTSPMALMDKGENELTG 
SASEESQETTTSTIITTTVITTEQAPALCSVSFSNPEGYIDSSDYPLLPLNNFLECTYNVTVY 

TGYGVELQVKS VNLS DGELLS I RGVDG PTLTVLANQTLL VEGQVI RS PTNT I S VY FRT FQ DDG 
LGTFQLHYQAFMLSCNFPRRPDSGDVTVMDLHSGGVAHFHCHLGYELQGAKMLTCINASKPHW 
SSQEPICSAPCGGAVHNATIGRVLSPSYPENTNGSQFCIWTIEAPEGQKLHLHFERLLLHDKD 
RMTVHSGQTNKSALLYDSLQTESVPFEGLLSEGNTIRIEFTSDQARAASTFNIRFEAFEKGHC 
YEPYIQNGNFTTSDPTYNIGTIVEFTCDPGHSLEQGPAIIECINVRDPYWNDTEPLCRAMCGG 
ELSAVAGWLSPNWPEPY'VEGEDCIWKIHVGEEKRIFLDIQFLNLSNSDILTIYDGDEVMPHI 
LGQYLGNSGPQKLYSSTPDLTIQFHSDPAGLIFGKGQGFIMNYIEVSRNDSCSDLPEIQNGWK 
TTSHTELVRGARITYQCDPGYDIVGSDTLTCQWDLSWSSDPPFCEKIMYCTDPGEVDHSTRLI 
SDPVLLVGTTIQYTCNPGFVLEGSSLLTCYSRETGTPIWTSRLPHCVSEESLACDNPGLPENG 
YQILYKRLYLPGESLTFMCYEGFELMGEVTIRCILGQPSHWNGPLPVCKVNQDSFEHALEAEA 
AAETSLEGGNMALAIFIPVLIISLLLGGAYIYITRCRYYSNLRLPLMYSHPYSQITVETEFDN- 

PIYETGETREYEVSI 

Signal peptide: 
amino acids 1-28 
Transmembrane domain: 
amino acids 893-915 
N-glycosylation sites. 

amino acids 311-315, 328-332, 350-354, 435-439, 458-462, 474-478, 
514-518, 576-580, 618-622, 674-678, 742-746 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 188-192 
N-myristoylation sites . 

amino acids 23-29, 87-93, 146-152, 454-460, 475-481, 575-581, 
629-635, 695-701, 723-729, 766-772, 877-883, 953-959 
Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 383-394 
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FTGPRE 201 

GATGGCTACGGCAGGGGGTGGCTCTGGGGCTGACCCGGGAAGTCGGGGTCTCCTTCGCCTTCT 
GTCTTTCTGCGTCCTACTAGCAGGTTTGTGCAGGGGAAACTCAGTGGAGAGGAAGATATATAT 
CCCCTTAAATAAAACAGCTCCCTGTGTTCGCCTGCTCAACGCCACTCATCAGATTGGCTGCCA 
GTCTTCAATTAGTGGAGACACAGGGGTTATCCACGTAGTAGAGAAAGAGGAGGACCTACAGTG 
GGTATTGACTGATGGCCCCAACCCCCCTTACATGGTTCTGCTGGAGAGCAAGCATTTTACCAG 
GGATTTAATGGAGAAGCTGAAAGGGAGAACCAGCCGAATTGCTGGTCTTGCAGTGTCCTTGAC 
CAAGCCCAGTCCTGCCTCAGGCTTCTCTCCTAGTGTACAGTGCCCAAATGATGGGTTTGGTGT 
TTACTCCAATTCCTATGGGCCAGAGTTTGCTCACTGCAGAGAAATACAGTGGAATTCGCTGGG 
CAATGGTTTGGCTTATGAAGACTTTAGTTTCCCCATCTTTCTTCTTGAAGATGAAAATGAAAC 
CAAAGTCATCAAGCAGTGCTATCAAGATCACAACCTGAGTCAGAATGGCTCAGCACCAACCTT 
CCCACTATGTGCCATGCAGCTCTTTTCACACATGCATGCTGTCATCAGCACTGCCACCTGCAT 
GCGGCGCAGCTCCATCCAAAGCACCTTCAGCATCAACCCAGAAATCGTCTGTGACCCCCTGTC 
TGATTACAATGTGTGGAGCATGCTAAAGCCTATAAATACAACTGGGACATTAAAGCCTGACGA 
CAGGGTTGTGGTTGCTGCCACCCGGCTGGATAGTCGTTCCTTTTTCTGGAATGTGGCCCCAGG 
GGCTGAAAGCGCAGTGGCTTCCTTTGTCACCCAGCTGGCTGCTGCTGAAGCTTTGCAAAAGGC 
ACCTGATGTGACCACCCTGCCCCGCAATGTCATGTTTGTCTTCTTTCAAGGGGAAACTTTTGA 
CTAGATTGGCAGCTCGAGGATGGTCTACGATATGGAGAAGGGCAAGTTTCCCGTGCAGTTAGA 
GAATGTTGACTCATTTGTGGAGCTGGGACAGGTGGCCTTAAGAACTTCATTAGAGCTTTGGAT 
GCACACAGATCCTGTTTCTCAGAAAAATGAGTCTGTACGGAACCAGGTGGAGGATCTCCTGGC 
CACATTGGAGAAGAGTGGTGCTGGTGTCCCTGCTGTCATCCTCAGGAGGCCAAATCAGTCCCA 
GCCTCTCCCACCATCTTCCCTGCAGCGATTTCTTCGAGCTCGAAACATCTCTGGCGTTGTTCT 
GGCTGACCACTCTGGTGCCTTCCATAACAAATATTACCAGAGTATTTACGACACTGCTGAGAA 
CATTAATGTGAGCTATCCCGAATGGCTGAGCCCTGAAGAGGACCTGAACTTTGTAACAGACAC 
TGCCAAGGCCCTGGCAGATGTGGCCACGGTGCTGGGACGTGCTCTGTATGAGCTTGCAGGAGG 
AACCAACTTCAGCGACACAGTTCAGGCTGATCCCCAAACGGTTACCCGCCTGCTCTATGGGTT 
CCTGATTAAAGCCAACAACTCATGGTTCCAGTCTATCCTCAGGCAGGACCTAAGGTCCTACTT 
GGGTGACGGGCCTCTTCAACATTACATCGCTGTCTCCAGCCCCACCAACAGCACTTATGTTGT 
ACAGTATGCCTTGGCAAATTTGACTGGCACAGTGGTCAACCTCACCCGAGAGCAGTGCCAGGA 
TCCAAGTAAAGTCCCAAGTGAAAACAAGGATCTGTATGAGTACTCATGGGTCCAGGGCCCTTT 
GCATTCTAATGAGACGGACCGACTCCCCCGGTGTGTGCGTTCTACTGCACGATTAGCCAGGGC 
CTTGTCTCCTGCCTTTGAACTGAGTCAGTGGAGCTCTACTGAATACTCTACATGGACTGAGAG 
CCGCTGGAAAGATATCCGTGCCCGGATATTTCTCATCGCCAGCAAAGAGCTTGAGTTGATCAC 
CCTGACAGTGGGCTTCGGCATCCTCATCTTCTCCCTCATCGTCACCTACTGCATCAATGCCAA 
AGCTGATGTCCTTTTCATTGCTCCCCGGGAGCCAGGAGCTGTGTCATACTGAGGAGGACCCCA 
GCTTTTCTTGCCAGNTCAGCAGTTCACTTCCTAGAGCATCTGTCCCACTGGGACACAACCACT 
AATTTGTCACTGGAACCTCCCTGGGCCTGTCTCAGATTGGGATTAACATAAAAGAGTGGAACT 
ATCCAAAAGAGACAGGGAGAAATAAATAAATTGCCTCCCTTCCTCCGCTCCCCTTTCCCATCA 
CCCCTTCCCCATTTCCTCTTCCTTCTCTACTCATGCCAGATTTTGGGATTACAAATAGAAGCT 
* TCTTGCTCCTGTTTAACTCCCTAGTTACCCACCCTAATTTGCCCTTCAGGACCCTTCTACTTT 
TTCCTTCCTGCCCTGTACCTCTCTCTGCTCCTCAGCCCCACCCCTGTACCCAGCCACCTTCCT 
GACTGGGAAGGACATAAAAGGTTTAATGTCAGGGTCAAACTACATTGAGCCCCTGAGGACAGG 
GGCATCTCTGGGCTGAGCCTACTGTCTCCTTCCCACTGTCCTTTCTCCAGGCCCTCAGATGGC 
ACATTAGGGTGGGCGTGCTGCGGGTGGGTATCCCACCTCCAGCCCACAGTGCTCAGTTGTACT 
TTTTATTAAGCTGTAATATCTATTTTTGTTTTTGTCTTTTTCCTTTATTCTTTTTGTAAATAT 

ATATATAATGAGTTTCATTAAAATAGATTATCCC 
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FIGURE 202 

MATAGGGSGADPGSRGLLRLLSFCVLLAGLCRGNSVERKIYIPLNKTAPCVRLLNATHQIGCQ 

S S I SGDTGVI HWEKEE DLQWVLT DG PN PP YMVLLESKH FTRDLMEKLKGRTSR I AGLAVSLT 
KPSPASGFSPSVQCPNDGFGVYSNSYGPEFAHCREIQWNSLGNGLAYEDFSFPIFLLEDENET 
KVIKQCYQDHNLSQNGSAPTFPLCAMQLFSHMHAVISTATCMRRSSIQSTFSINPEIVCDPLS 
DYNVWSMLKPINTTGTLKPDDRVWAATRLDSRSFFWNVAPGAESAVASFVTQLAAAEALQKA 
PDVTTLPRNVMFVFFQGETFDYIGSSRMVYDMEKGKFPVQLENVDSFVELGQVALRTSLELWM 
HTDPVSQKNESVRNQVEDLLATLEKSGAGVPAVILRRPNQSQPLPPSSLQRFLRARNISGWL 
ADHSGAFHNKYYQSIYDTAENINVSYPEWLSPEEDLNFVTDTAKALADVATVLGRALYELAGG 
TNFSDTVQADPQTVTRLLYGFLIKANNSWFQSILRQDLRSYLGDGPLQHYIAVSSPTNTTYW 
QYALANLTGTWNLTREQCQDPSKVPSENKDLYEYSWVQGPLHSNETDRLPRCVRSTARLARA 
LSPAFELSQWSSTEYSTWTESRWKDIRARIFLIASKELELITLTVGFGILIFSLIVTYCINAK 

ADVLFIAPREPGAVSY 

Signal peptide: 

amino acids 1-33 

Transmembrane domain : 

amino acids 671-692 . 

N-glycosylation sites. 

amino acids 45-49, 55-59, . 187-191, 200-204, 204-208, 264-268, 

387-391, 417-421, 435-439, 464-468, 506-510, 530-534, 562-566, 
573-577, 580-584, 612-616 

Glycosaminoglycan attachment site. 

amino acids 404-408 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 232-236 



N-myristoylation site. 

amino acids 5-11, 6-12, 9-15, 29-35, 61-67, 120-126, 146-152, 
168-174, 205-211, 294-300, 438-444, 446-452, 504-510, 576-582" 
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PTGURE 203 

rrTArACCGAGCCCTGGGAGGCTACGGGCTCCCCCGGAAACCCTGCCAGGGGAGCCGGGTTTT 
GAGCTCAGGCGCCTCTAGCGGCGGCCCCCAGAAATCT 

A^TTTCACCCTGAAGGGGAGTTTGACTCGTATGAAGTC 

cggggagLgtgcagggtgtggtcagtcccgtgtcctacctactgcagttaaaaggcaagaa^ 

CACGTCCTCCATTTGTGGCCCAAGAGACTTCTGTTGCCCCGACATCTGCGCGTTTTCTCCTTC 
A^AGAACATGGGGAACTGCTGGAGGATCATCCTTACATACCAAAG 

?c2g^aaagagtctctggactctaaagctactataagcacatgcatggggggtctccg^ 

GTATTTAAC^ATTGATGCCAAACATTACCAAATTGAGCCCCTCAAGGCCTCTCCCAGTTTTGAA 

ca?g?otc?atctcctgaagaaagagcagtttgggaatcaggt 

GAAATAGAATGGCAGATGGCCCCTTATGAGAATAAGGCGAGGCTAAGGGACTTTCCTGGATCC 

?a^^ccStacttggaattgatcctactctttgatcaaagtaggtataggtttgt^ 
aa™aa£t^ 

^caagatgttcgtatgaggatacacttaaaggctcttgaagtatggacagattttaaca^ 

ATACGCGTTGGATATCCAGAGTTAGCTGAAGTTTTAGGCAGATTTGTAATATATAAAAAAAGT 

™aaI?gctcgcctgt^ 

GCTCTTG^ATGGTCGTTTGGAAAAGTGTGTTCTCTAGAATATGCTGGATCAGTGAG 

cSSacaaa^^^ 

ATGTCAC^ATGATGAACAATACTGCCAATGTAGGGGTAGGCTTAATTGCATCATGGGCTCAGGA 

cgcactggg???agcaattg^ 

TGTCTAAATAATATCCCAGGACTAGGTTATGTGCTTAAGAGA 

gaS^gg^tgtgactgtggttc^^^ 

AATTCTAAGTTGCAACCAGGTGCCAACTGTAGCATTGGACTTTGCTGTCATGATTGTCGGTTT 

^cc™?ggatacgtgtgtaggcaggaaggaaatgaatgtga 

GGGAATTCAAGTTCCTGCCC^AAATGACGTTTATAAGCAG 

ggc^?g^?caggaaggggtgcagatcca 

rATGCCATGGAGGCTCCTAGTGAGTGCTATGATGCAGTTAACTTAATAGGTGATCAATTTGGT 

aac?g5gagat?acaggaatt^ 

mgcta?agtgtataaatgttgaaaccatccctgatttgccagag^ 
mgggaatacct^ 

TTTAAAAAAAATTGCGTCAATAGCTCAGTCCTGCAGTTTGACTGTTTGCCTGAGAAATGCAA 
ArrrrrTGTGTTTGCAACAACAGAAAAAACTGCCACTGCATGTATGGGTGGGCACCTCCATTC 

?g£gaggaI^ 

ATTCCCTCGTCAATTTGGGTTGTGTCCATCATAATGTTTCGCCTTATTTTATTAATCCTT^ 

g?gSttttg^ 

SacSJccaaaLaaaaactgaacaggaagaatctaaaacaaa^ 
aaaacaaaaSggacaggaagaatctgaagcaaaaactggacaggaagaatctaaag^ 

A^TCGACAGGAAGAATCTAAAGCAAACATTGAAAGTAAACGACCCAAAGCAAAGAGTGTCAAG 
AAACAAAAAAAGTAACCGGGCAATCCATACTCATTCAGTAACACAGGCTCATTTATTTAACCA 
GCTAATCATTTATCCAAAGGCTTTCCATTCTTCTCCCAATATTTTTTTACTTTAATTTTTCCC 
ACAAGTTTTGATCAGCAAATAAACAGCATTCTTGTTTTGGAAACAAAAA 
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FIGURE 204 

MRSVQIFLSQCRLLLLLVPTMLLKSLGEDVIFHPEGEFDSYEVTIPEKLSFRGEVQGVVSPVS 
YLLQLKGKKHVLHLWPKRLLLPRHLRVFSFTEHGELLEDHPYIPKDCNYMGSVKESLDSKATI 

STCMGGLRGVFN I DAKH YQ I EPLKAS PS FEHW YLLKKEQFGNQVCGLS DDE I EWQMAPY ENK 
ARLRDFPGSYKHPKYLELILLFDQSRYRFVNNNLSQVIHDAILLTGIMDTYFQDVRMRIHLKA 
LEVWTDFNKIRVGYPELAEVLGRFVIYKKSVLNARLSSDWAHLYLQRKYNDALAWSFGKVCSL 
EYAGSVSTLLDTNILAPATWSAHELGHAVGMSHDEQYCQCRGRLNCIMGSGRTGFSNCSYISF 
FKHISSGATCLNNIPGLGYVLKRCGNKIVEDNEECDCGSTEECQKDRCCQSNCKLQPGANCSI 
GLCCHDCRFRPSGYVCRQEGNECDLAEYCDGNSSSCPNDVYKQDGTPCKYEGRCFRKGCRSRY 
MQCQSIFGPDAMEAPSECYDAVNLIGDQFGNCEITGIRNFKKCESANSICGRLQCINVETIPD 
LPEHTTIISTHLQAENLMCWGTGYHLSMKPMGIPDLGMINDGTSCGEGRVCFKKNCVNSSVLQ 
FDCLPEKCNTRGVCNNRKNCHCMYGWAPPFCEEVGYGGSIDSGPPGLLRGAIPSSIWWSIIM 
FRLILLILSWFVFFRQVIGNHLKPKQEKMPLSKAKTEQEESKTKTVQEESKTKTGQEESEAK 

TGQEESKAKTGQEESKANIESKRPKAKSVKKQKK 



Signal peptide: 

amino acids 1-27 
Transmembrane domain: 
amino acids 684-705 
N-glycosylation sites . 

amino acids. 222-226, 372-376, 438-442, 473-477, 625-629 
N-myristoylation sites. 

amino acids 131-137, 168-174, 235-241, 319-325, 364-370, 436-442, 

472-478, 609-615, 642-648, 668-674, 676-680, 680-686, 749-755, 

758-764, 767-773 

Amidation site. 

amino acids* 69-73 

Disintegrins proteins 

amino acids 429-479 

EGF-like domain proteins 

amino acids 650-662 

Neutral zinc metallopeptidases , zinc-binding region proteins 

amino acids 335-345 
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FIGURE 205 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGGGAAGGTTGAATGGGGTAGAAGGCCTG 

TTGTGGAGGGAAACCACCCATCCTCCTGCCTCCCACCACCACCATCATCCTGGCTGGACGGAG 

AGGGTGACGGGGGCTGGGAAGGGGCAGCTCATGTTCAGGTTTCCAGGAGGGGCTACCTGTTGA 

CTGTCTTTGCAGGAAGAAGAAAACACCTGAGTGACCAGATGTCCCAGCTCCAGGTGCCTTGCC 

AGATGGCCAGAACCACACCTCTTGAAGAGTGACAGTGCTGTGGAGCATGGTTTCTGCACACCT 

GGAATGACTGGAACCCCAAAGACTCAAGAAGGAGCTAAAGATCTTGAAGTAGACATGAATAAA 

ACAGAAGGCTGTGGACCACCTGTCGAGATGGAGAAGTCCTTCTGAGGCTATCCAAACACGGAC 

CAGGCCATGAGACCCCGATGACCATCCCTGAATTTTTTCGAGAGTCAGTCAACCGATTTGGAA 

CTTATCCAGCCCTCCCATCCAAGAATGGCAAAAAGTGGGAAATTCTGAATTTCAACCAGTACT 

ATGAGGCTTGTCGGAAGGCTGCAAAATCCTTGATCAAGCTGGGTTTGGAGCGTTTCCACGGAG 

TTGGTATCCTGGGGTTTAACTCTGCAGAGTGGTTTATCACTGCTGTTGGTGCCATCCTAGCCG 

GGGGTCTTTGTGTTGGTATTTATGCCACCAACTCTGCCGAGGCTTGTCAATATGTCATCACTC 

ATGCCAAAGTGAACATCTTGCTGGTTGAGAATGATCAACAGTTACAGAAAATCCTTTCGATTC 

CACAGAGCAGGCTAGAGCCCCTAAAAGCGATCATCCAGTACAGACTGCCAATGAAGAAGAACA 

ACAACTTGTACTCTTGGGATGATTTCATGGAACTTGGCAGAAGTATCCCTGACACCCAACTGG 

AGCAGGTCATCGAGAGCCAGAAGGCGAATCAATGCGCAGTGCTCATCTACACTTCAGGGACCA 

CAGGCATACCCAAGGGAGTGATGCTCAGTCATGACAACATCACGTGGATTGCAGGAGCAGTGA 

CAAAGGACTTTAAACTGACAGACAAGCATGAGACGGTGGTTAGCTACCTCCCACTCAGCCATA. 

TTGCAGCACAGATGATGGACATCTGGGTACCCATAAAGATTGGGGCGCTCACATACTTTGCTC 

AAGCAGATGCTCTCAAGGGCACCTTGGTAAGTACTCTAAAGGAGGTAAAACCTACTGTCTTCA 

TTGGAGTGCCTCAAATTTGGGAGAAGATACATGAGATGGTGAAGAAAAATAGTGCCAAGTCCA 

TGGGCTTGAAGAAGAAGGCATTCGTGTGGGCAAGAAACATTGGCTTCAAGGTCAACTCAAAAA 

AGATGTTGGGGAAATATAATACTCCCGTGAGCTACCGCATGGCTAAGACTCTCGTGTTCAGCA 

AAGTCAAGACATCCCTTGGCTTGGATCACTGTCACTCTTTTATCAGTGGGACTGCGCCCCTCA 

ACCAAGAGACTGCCGAGTTCTTTCTAAGCTTGGACATACCTATAGGCGAGTTGTATGGGTTGA 

GTGAGAGCTCGGGACCCCACACGATATCCAACCAGAATAACTACAGGCTTCTAAGCTGTGGCA 

AGATCTTGACTGGGTGTAAGAATATGCTGTTCCAGCAGAACAAGGATGGCATTGGGGAGATCT 

GCCTCTGGGGTAGGCACATCTTCATGGGCTATCTGGAAAGTGAGACTGAAAGTACAGAGGCCA 

TCGATGATGAAGGCTGGCTACACTCTGGGGATCTGGGCCAGCTGGACGGTCTGGGTTTCCTCT 

ATGTCACCGGCCACATCAAAGAAATCCTTATCACTGCTGGTGGTGAAAATGTGCCCCCCATTC 

CTGTTGAGACCTTGGTTAAGAAGAAGATCCCCATCATCAGTAACGCCATGTTAGTAGGAGATA 

AACTGAAGTTTCTGAGCATGTTGCTGACGCTGAAGTGTGAGATGAATCAGATGAGCGGAGAAC 

CTCTGGACAAGCTGAACTTCGAGGCCATCAACTTCTGTCGGGGTCTGGGCAGCCAGGCATCCA 

CCGTGACTGAGATTGTGAAGCAGCAAGACCCCCTGGTCTACAAGGCCATCCAGCAAGGCATCA 

ATGCTGTGAACCAGGAAGCCATGAACAATGCACAGAGGATTGAAAAGTGGGTCATCTTGGAGA 

AGGACTTTTCCATCTATGGTGGAGAGCTAGGTCCAATGATGAAACTTAAGAGACATTTTGTAG 

CCCAGAAATACAAAAAACAAATTGATCACATGTACCACT^CTGCTTTGATGGAGCTGCTCTC 

AGCTGTTCTGATGCCTTCAGCAGGAAGACCTCATTGCAATAAGTGAAATGCTGCTCTAGGTAG 

AAGCTCTCCCTGCTGTTTTTAAGAAGCCACATTCCTCATTGGTCAGTTTCTTGATTGTTCGTC 

TGTTGGAGAGGTGCTCCCTAGAAGAACCTGCCATACGTTTCAAAGCAATAAAATCACTGTATA 

TCTTTCTAAGGACCTTCAAGTCATGACTCCAGGGAAGCCTATTGGGAAGTCTACTAAAAACTG 

CCTGATTTACAAGAAAGACCTGAACTTGTGGGCTCCCATTTGATTTTTTTCTCCTCAGGGGAC 

TCAGACATTAGAAAGAAAAAGCCTCACAGATTTGAAGAACTGGACCCCCAAATCAACTCACCT 

GCCTGGAAGCAACTGGGAAACCCTTCCAATAAGTCCTGATAATAAAGCACTTCAGGGTCCCAA 

AAAAAAAAAA 
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206/550 
FT CURE 206 

mtip: ™'" 



EFFRESVNRFGTYPALPSKNGKKWEILNFNQYYEACRKAAKSLIKLGLERFHGVGILGF 



PQSSLE 



N S AEW FITAVGAILAGGLC VG I YATN SAEACQY V ITHAKVN I LL VEN DQQLQKI LS I 
PLKAI IQYRLPMKKNNNLYSWDDFMELGRSI PDTQLEQVI ESQKANQCAVLI YTSGTTGI PKG 
VMLSHDNITWIAGAVTKDFKLTDKHETWSYLPLSHIAAQMMDIWVPIKIGALTYFAQADALK 
GTLVSTLKEVKPTVFIGVPQIWEKIHEMVKKNSAKSMGLKKKAFVWARNIGFKVNSKKMLGKY 
NTPVSYRMAKTLVFSKVKTSLGLDHCHSFISGTAPLNQETAEFFLSLDIPIGELYGLSESSGP 
HTISNQNNYRLLSCGKILTGCKNMLFQQNKDGIGEICLWGRHIFMGYLESETETTEAIDDEGW 
LHSGDLGQLDGLGFLYVTGHIKEILITAGGENVPPIPVETLVKKKIPIISNAMLVGDKLKFLS 
MLLTLKCEMNQMSGEPLDKLNFEAINFCRGLGSQASTVTEIVKQQDPLVYKAIQQGINAVNQE 
AMNNAQRIEKWVILEKDFSIYGGELGPMMKLKRHFVAQKYKKQIDHMYH 

Signal peptide: 

amino acids 1-22 
Transmembrane domain : 
amino acids 65-86 
N-glycosylation site. 

amino acids 196-200 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 282-286 

Tyrosine kinase phosphorylation sites. 

amino acids 547-555, 608-616 
N-myristoylation sites. 

amino acids 15-21, 74-80, 80-86, 84-90, 185-191, 189-195, 
253-259, 337-343, 371-377, 448-454, 536-542 
Amidation site, 
amino acids 24-28 

Putative AMP-binding domain signature. 

amino acids 177-189 

Putative AMP-binding domain proteins. 

amino acids 173-190 
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207/550 

FT CURE 207 

rrraCGCGTCCGCCCACGCGTCCGCGGACGCGTGGGGCCAGATCGCGGCCGGCGCCAGCGCCA 
CCOTCCGOTCCACCCG^ 

ccgagga?caggSggggc^ 

^GGGAGCGCCCGAGCCGCCCGGCTGCCCGGAGCCCCATCG 

SgcaIccgcSgttcc^ 

gctcaIcagcccccggagaagacggcgcccccaacgcccgacccgcgtggccgtggcagcgc^ 
acgcgagccctctaScgaccgcag 

CMGACCCCCGGCGCGGGCCCATGGAGCCA^GCCTATAG 

Scagccgcg^gggcgcgcaggctgctcttcgcctt^ 
?ctgSacagcttcctgtgctgctgcgacgacctgggt 

rCTCCGGGCCGACGCCCAGCGAGCCCAGCGCTCCCAGCGCGCCCGCCGCCGCCGTGCCCGCCC 

Scgcc?c??cgg?^caI^ 

CCCTCATTGTGGGCGTGAAGAAGGGGGGCACCCGGGCCGTGCTGGAGTTTATCCGAGTACACC 

?ggI?g?g?gggccttgggcacggaaccccact 

a??ggtIcaggagcctgatgcccaggaccctcgagagccagatcacgctggagaaga 
gSactttgS^ 

?cg?gg??g?gcggaa?cctgtgacccgtgcca^ 

tt^agaccgtataatatcaaattttatgaaaccgttgggcaggacttcaggtgggaasmg 
cccacgaaaggaaagggctctcaagggctcttctgctcatctcttccgt 

ATCTTGGAACCAGGAAGCCCAGCTAAAGCCAAGAGACCAGAGAGTCCCTGCCACTAGTTTTCA 

?caSc?cttcaag^gttgatctgctcctggcacgtccagtaaattccag^ 
c?caggag^gcctt^ 

gSgc???gggtggtgggatgggagaccatcctcagccctcctcagac 
agagattgSaaagctgaaagcacctccggccactcctgggagacagaccctttggtgatg^ 

taaaccagtgacttcagagcctatggtctcaactgtgcttgaaaaacac 

aotctgattctccctgctccctgtggacaaaagcac^^ 

gSStIcgIgSttcatg^ 
ggc5ctctcg?gtccttaa 

gagggcaaggactacatggggcagaggcaaggcggtggtggagat^ 

TrrrAGAAGCTGGGGCAGAAAGATCACATGAGATCTGTGGGGACACCCTCTATCTGAAACAlA 
TTGATGGGGCATCAGATATC^ 

TTAGA^ACTGGGAGCATTCATATGTAAAAAATATGGATTTTCAGCTTCTTCTTAAAAAAA 
AAAAAAAAAA 
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208/550 
FIGURE 208 

MAYRVLGRAGPPQPRRARRLLFAFTLSLSCTYLCYSFLCCCDDLGRSRLLGAPRCLRGPSAGG 
QKLLQKSRPCDPSGPTPSEPSAPSAPAAAVPAPRLSGSNHSGSPKLGTKRLPQALIVGVKKGG 
TRAVLEFIRVHPDVRALGTEPHFFDRNYGRGLDWYRSLMPRTLESQITLEKTPSYFVTQEAPR 
RIFNMSRDTKLIWVRNPVTRAISDYTQTLSKKPDIPTFEGLSFRNRTLGLVDVSWNAIRIGM 
YVLHLESWLQYFPLAQIHFVSGERLITDPAGEMGRVQDFLGIKRFITDKHFYFNKTKGFPCLK 
KTESSLLPRCLGKSKGRTHVQIDPEVIDQLREFYRPYNIKFYETVGQDFRWE 

Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 102-106, 193-197, 235-239, 306-310 

Tyrosine kinase phosphorylation site. 

amino acids 296-305 

N-myristoylation sites. 

amino acids 51-57, 100-106, 121-127, 125-131 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 20-31 
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209/550 

FIGURE 209 

CTT.TCCTTATCTGTGTGTACTCTTATCTCACTGTTCTATTTTTTCTCCTCATTTATATTAACT 

CTTTCTTACCTTTTTTTCTGAACTTCTAGGCCTTCTCTTTCCAGAACTGGTGGAAGACAAATG 

AAACGGCCAAGATGGTAAGAAACAAGCCGCATTTCTCCTTGGGGAGACTGATAATTTAAAAGG 

TTTGTTGTGTCAGAAACATTCCCAGCTTCATCACCAACCCTTTCCTTCCACCTCTGCCCACTG 

GAGACCACTTACATCCCGAAGCGGACGCGGCAGCTGAAGTCAGGAAACCATGCATCACATTAG 

CAGGAGCCAACTGCAGACTTTAAACTCCGTTCAACATGTGGATGCGGCAGAGAAATGACCTGT 

CCAGACAAGCCGGGGCAGCTCATAAACTGGTTCATCTGCTCCCTGTGCGTCCCGCGGGTGCGT 

AAGCTCTGGAGCAGCCGGCGTCCAAGGACCCGGAGAAACCTTCTGCTGGGCACTGCGTGTGCC 

ATCTACTTGGGCTTCCTGGTGAGCCAGGTGGGGAGGGCCTCTCTCCAGCATGGACAGGCGGCT 

GAGAAGGGGCCACATCGCAGCCGCGACACCGCCGAGCCATCCTTCCCTGAGATACCCCTGGAT 

GGTACCCTGGCCCCTCCAGAGTCCCAGGGCAATGGGTCCACTCTGCAGCCCAATGTGGTGTAC 

ATTACCCTACGCTCCAAGCGCAGCAAGCCGGCCAATATCCGTGGCACCGTGAAGCCCAAGCGC 

AGGAAAAAGCATGCAGTGGCATCGGCTGCCCCAGGGCAGGAGGCTTTGGTCGGACCATCCCTT 

CAGCCGCAGGAAGCGGCAAGGGAAGCTGATGCTGTAGCACCTGGGTACGCTCAGGGAGCAAAC 

CTGGTTAAGATTGGAGAGCGACCCTGGAGGTTGGTGCGGGGTCCGGGAGTGCGAGCCGGGGGC 

CCAGACTTCCTGCAGCCCAGCTCCAGGGAGAGCAACATTAGGATCTACAGCGAGAGCGCCCCC 

TCCTGGCTGAGCAAAGATGACATCCGAAGAATGCGACTCTTGGCGGACAGCGCAGTGGCAGGG 

CTCCGGCCTGTGTCCTCTAGGAGCGGAGCCCGTTTGCTGGTGCTGGAGGGGGGCGCACCTGGC 

GCTGTGCTCCGCTGTGGCCCTAGCCCCTGTGGGCTTCTCAAGCAGCCCTTGGACATGAGTGAG 

GTGTTTGGCTTCCACCTAGACAGGATCCTGGGGCTCAACAGGACCCTGCCGTCTGTGAGCAGG 

AAAGCAGAGTTCATCCAAGATGGCCGCCCATGCCCCATCATTCTTTGGGATGCATCTTTATCT 

TCAGCAAGTAATGACACCCATTCTTCTGTTAAGCTCACCTGGGGAACTTATCAGCAGTTGCTG 

AAACAGAAATGCTGGCAGAATGGCCGAGTACCCAAGCCTGAATCAGGTTGTACTGAAATACAT 

CATCATGAGTGGTCCAAGATGGCACTCTTTGATTTTTTGTTACAGATTTATAATCGCTTAGAT 

ACAAATTGCTGTGGATTCAGACCTCGCAAGGAAGATGCCTGTGTACAGAATGGATTGAGGCCA 

AAATGTGATGACCAAGGTTCTGCGGCTCTAGCACACATTATCCAGCGAAAGCATGACCCAAGG 

CATTTGGTTTTTATAGACAACAAGGGTTTCTTTGACAGGAGTGAAGATAACTTAAACTTCAAA 

TTGTTAGAAGGCATCAAAGAGTTTCCAGCTTCTGCAGTTTCTGTTTTGAAGAGCCAGCACTTA 

CGGCAGAAACTTCTTCAGTCTCTGTTTCTTGATAAAGTGTATTGGGAAAGTCAAGGAGGTAGA 

CAAGGAATTGAAAAGCTTATCGATGTAATAGAACACAGAGCCAAAATTCTTATCACCTATATC 

AATGCACACGGGGTCAAAGTATTACCTATGAATGAATGACAAAAGAATCTTCTGGCTAGGGTG 

TTAGATATATTTATGCATTTTTGGTTTTGTTTTTAAATCAAGCACATCAACCTCAAGCCCGTT 

TAGCAATGAGGCAGTGTAGATGAATACGTAAAATAAATGACTTTAACCAAGTAGCTATAAAGG 

GACTTAGCACTGTATGCATACTTAAAAAGGTTTTGAAAAACAAACTACTTGAGAAATATTTGT 

TTATATTTTTCTCTAACATCATGCTATGTGTCAGTCTGAACATCTGACAACAGAAATTTCAGT 

TATTATTCTAGCTAAGTTTTGAAAACATTTGTCATGCTGTTTAATAGAAAACTGCAAACCAGA 

GATACTGACTCCATTAATAAACCATATTTTGTGCCGTTTTGACTGTTCTGACCAAATACTAAT 

GGGAACAATTCTTGACGTTTTTCTGTTGCTGATTGTTAACATAGAGCAGTCTCTACACTACCC 

TGAGGCAACTCTACATTGGAACACTGAGGCTTACAGCCTGCAAGAGCATCAGAGCTGACCATA 

CATTTAAACAGAAATGCTGGTTTATTTGCAAAATCACCAGTATATTTTCTATTGTGTCTATAA 

AAAATCAGTCATTTAAGTACAAGAATCATATTTTCCATTCCTTTTTAGAAATTTATTTTGTTG 

TCCCTATGGAAATCATTCACATCTGACAATTTATATGTTAAAGAGTTTTACTCTCTCTATTTT 

GGTCCAATTTGTATCTAGTGGCTGAGAAATTAAATAATTCTAAAGTATGAAGTTACCTATCTG 

AAAATGTACTTACAGAGTATCATTTTAAAATGGATGTCTCTTTAAAAATTTTGTTACTTTTAC 

CAACAATGTAATATAATTTATGTATATTTTATTAATAATAGTGAATTCCTTAAAATTTGTTCT 

ATGTACTTATATTTAATTTGATTTAATGGTTACTGCCCAGATATTGAGAAATGGTTCAAATAT 

TGAGTGTGTTTCAATAA 
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210/550 
FIGURE 210 

MTCPDKPGQLINWFICSLCVPRVRKLWSSRRPRTRRNLLLGTACAIYLGFLVSQVGRASLQHG 
QAAEKGPHRSRDTAEPSFPEIPLDGTLAPPESQGNGSTLQPNWYITLRSKRSKPANIRGTVK 
PKRRKKHAVASAAPGQEALVGPSLQPQEAAREADAVAPGYAQGANLVKIGERPWRLVRGPGVR 
AGGPDFLQPSSRESNIRIYSESAPSWLSKDDIRRMRLLADSAVAGLRPVSSRSGARLLVLEGG 
APGAVLRCGPSPCGLLKQPLDMSEVFAFHLDRILGLNRTLPSVSRKAEFIQDGRPCPIILWDA 
SLSSASNDTHSSVKLTWGTYQQLLKQKCWQNGRVPKPESGCTEIHHHEWSKMALFDFLLQIYN 
RLDTNCCGFRPRKEDACVQNGLRPKCDDQGSAALAHIIQRKHDPRHLVFIDNKGFFDRSEDNL 
NFKLLEGIKEFPASAVSVLKSQHLRQKLLQSLFLDKVYWESQGGRQGIEKLIDVIEHRAKILI 

TYINAHGVKVLPMNE 

Transmembrane domain: 

amino acids 40-56 

N-glycosylation sites. 

amino acids 98-102, 289-293, 322-326 

N-myristoylation sites . 

amino- acids 8-14, 41-47, 97-103, 187-193, 251-257,. 252-258, 
287-293, 484-490 
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211/550 

FTGURE 211 

GTGGGGTGGTGAGCGCAGCGCCGAGGATGAGGAGGTGCAACAGCGGCTCCGGGCCGCCGCCGTCGCTGCTGCTGC 
TGCTGCTGTGGCTGCTCGCGGTTCCCGGCGCTAACGCGGCCCCGCGGTCGGCGCTCTATTCGCCTTCCGACCCGC 
TGACGCTGCTGCAGGCGGACACGGTGCGCGGCGCGGTGCTGGGCTCCCGCAGCGCCT 

CCTCCTGGTGCGGCCACTGCATCGCCTTCGCCCCGACGTGGARGGCGCTGGCCGAAGACGTCAAAGCCTGGAGGC 
rrrrCCTGTATCTCGCCGCCCTGGACTGTGCTGAGGAGACCAACAGTGCAGTCTGCAGAGACTTCAACATCCCTG 

acctgcagacgctgcgggaga^ 

ttgaaaagggaggctcctacctgggtagagaggtggctctggacctgtcccagcacaaaggcgtggcggtgcgca 

ggg^gScacagagS^ 

ggwgg?^gtctcccgagtcS 

SctcaccagggaggSS 

rat5ggcagatcgctccaagatctacatggctgacctggaatctgcactgcactacatcct 

^agg^?c?gctcctgg^gggcaS 

ctggccggc^^ 

cctacagtttctttaaaactgccctggacgacaggaaagagggtgccgttcttgcc^ 

gSgccaggggagtgagccgc^^ 
aggcagctcggcaaaatgtagaccactcac^^ 

acgtgcactacttcttcggctgccgagactgcgctagccacttcgagcagatggctgctgcctccatgcaccggg 

JggggactcccaacgccgctgtS 

ccag?SaggacSc?agttccc^ 

?gga?g?gc?cgtgtgg^ 

tggacttccctgcagctgggtcagctgcccggagggatgtgcagaatgtggcagccgccccagagctggcgatgg 
Jagccctggagctgg^ 

caggccaggagcctcctgagcacatggcaga 

tcagcaagcgagacacaggggctgcattgctggctga 

aggtcaggcgcgtgggccgcagctccaagcagctggtcga 

rrrrrCGAGGCCAGTGGCTGCAGGTGCTGGGAGGGGGCTTCTCTTACCT 
??GGCCACCCTGCAGCCTGAA 

cctgaSccS?Sctcc^cccaS 
SgSccaotgaagStcttgggS 

ttcccctgctg^cagggagggcagccccgggcagtgggcatagggcagctcagtccctggcctcttagcaccac 

ATTCCTGTTTTTCAGCTTATTTGAAGTCCTGCC 

ScSSgggg^aagtgaag^ 

TGGGGCTCCGCCCACCCTGCTCCCTTCCGGACAATGAAGAAGCCTTTGCACCCTGGGA 

TTCTGGAAGTCGTGCTGGTCTCCCAGGTGAGGCAAGCCATGGTTGCTGGGCTGTAGGGTGAGTGGCTTGCTTGGT 
GGGACCTCACGAGTTGGTGGCATGGGAAGGATGTGGG 

TGGCTGCTTTCACTTCCCCCCATTGAGCTCTGCTCCCTCTGAGCCTGGTCTTTTGTCCT 

c^SSgcS 

gccttgggaggtgtgtggaataaattatttttgttaaggca 
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212/550 

FIGURE 212 

MRRCNSGSGPPPSLLLLLLWLLAVPGANAAPRSALYSPSDPLTLLQADTVRGAVLGSRSAWAV 
EFFASWCGHCIAFAPTWKALAEDVKAWRPALYLAALDCAEETNSAVCRDFNIPGFPTVRFFKA 
FTKNGSGAVFPVAGADVQTLRERLIDALESHHDTWPPACPPLEPAKLEEIDGFFARNNEEYLA 
LIFEKGGSYLGREVALDLSQHKGVAVRRVLNTEANWRKFGVTDFPSCYLLFRNGSVSRVPVL 
MESRSFYTAYLQRLSGLTREAAQTTVAPTTANKIAPTVWKLADRSKIYMADLESALHYILRIE 
VGRFPVLEGQRLVALKKFVAVLAKYFPGRPLVQNFLHSVNEWLKRQKRNKIPYSFFKTALDDR 
KEGAVLAKKVNWIGCQGSEPHFRGFPCSLWVLFHFLTVQAARQNVDHSQEAAKAKEVLPAIRG 
YVHYFFGCRDCASHFEQMAAASMHRVGSPNAAVLWLWSSHNRVNARLAGAPSEDPQFPKVQWP 
PRELCSACHNERLDVPVWDVEATLNFLKAHFSPSNIILDFPAAGSAARRDVQNVAAAPELAMG 
ALELESRNSTLDPGKPEMMKSPTNTTPHVPAEGPEASRPPKLHPGLRAAPGQEPPEHMAELQR 
NEQEQPLGQWHLSKRDTGAALLAE S RAEKNRLWG PLE VRRVGRS SKQLVD I PEGQLEARAGRG 
RGQWLQVLGGGFSYLDISLCVGLYSLSFMGLLAMYTYFQAKIRALKGHAGHPAA 

Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 705-728 

N-glycosylation sites. 

amino acids 130-134, 243-247, 575-579 

Glycosaminoglycan attachment site. 

amino acids 6-10 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 644-648 
N-myristoylation sites . 

amino acids 52-58, 56-62, 196-202, 381-387, 392-398, 448-454, 
468-474, 684-690, 702-708 

Cytochrome c family heme-binding site signature. 

amino acids 509-515 
Thioredoxin family proteins 

amino acids 62-78 
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213/550 

FIGURE 213 

GCACGAGGCCGACTTCCAGACCATCTACAACTGCACGGCCTGGAACAGCTTCGGCTCCGACAC 

TGAGATCATCCGGCTCAAGGAGCAAGGTTCGGAAATGAAGTCGGGAGCCGGGCTGGAAGCAGA 

GTCTGTGCCGATGGCCGTCATCATTGGGGTGGCCGTAGGAGCTGGTGTGGCCTTCCTCGTCCT 

TATGGCAACCATCGTGGCGTTCTGCTGTGCCCGTTCCCAGAGAAATCTCAAAGGTGTTGTGTC 

AGCCAAAAATGATATCCGAGTGGAAATTGTCCACAAGGAACCAGCCTCTGGTCGGGAGGGTGA 

GGAGCACTCCACCATCAAGCAGCTGATGATGGACCGGGGTGAATTCCAGCAAGACTCAGTCCT 

GAAACAGCTGGAGGTCCTCAAAGAAGAGGAGAAAGAGTTTCAGAACCTGAAGGACCCCACCAA 

TGGCTACTACAGCGTCAACACCTTCAAAGAGCACCACTCAACCCCGACCATCTCCCTCTCCAG 

CTGCCAGCCCGACCTGCGTCCTGCGGGTAAGCAGCGTGTGCCCACAGGCATGTCCTTCACCAA 

CATCTACAGCACCCTGAGCGGCCAGGGCCGCCTCTACGACTACGGGCAGCGGTTTGTGCTGGG 

CATGGGCAGCTCGTCCATCGAGCTTTGTGAGCGGGAGTTCCAGAGAGGCTCCCTCAGCGACAG 

CAGCTCCTTCCTGGACACGCAGTGTGACAGCAGCGTCAGCAGCAGCGGCAAGCAGGATGGCTA 

TGTGCAGTTCGACAAGGCCAGCAAGGCTTCTGCTTCCTCCTCCCACCACTCCCAGTCCTCGTC 

CCAGAACTCTGACCCCAGTCGACCCCTGCAGCGGCGGATGCAGACTCACGTCTAAGGATCACA 

CACCGCGGGTGGGGACGGGCCAGGGAAGAGGTCAGGGCACGTTCTGGTTGTCCAGGGACGAGG 

GGTACTTTGCAGAGGACACCAGAATTGGCCACTTCCAGGACAGCCTCCCAGCGCCTCTGCCAC 

TGCCTTCCTTCGAAGCTCTGATCAAGCACAAATCTGGGTCCCCAGGTGCTGTGTGCCAGAGGT 

GGGCGGGTGGGGAGACAGACAGAGGCTGCGGCTGAGTGCGCTGTGCTTAGTGCTGGACACCCG ^ 

TGTCCCCGGCCCTTTCCTGGAGGCCCCTCTACCACCTGCTCTGCCCACAGGCACAAGTGGCAG 

CTATAACTCTGCTTTCATGAAACTGCGGTCCACTCTCTGGTCTCTCTGTGGGCTCTACCCCTC 

ACTGACCACAAGCTCTACCTACCCCTGTGCCTGTGCTCCCATACAGCCCTGGGGAGAAGGGGA 

TGACGTCTTCCCAGCACTGAGCTGCCCCAGAAACCCCGGCTCCCCACTGCTGCTCATAGCCCA 

TACCCTGGAGGCTGACAAGCCAGAAATGGCCTTGGCTAAAGGAGCCTCTCTCTCACCAGGCTG 

GCCGGGAGCCCACCCCCAATTTGTTTGGTGTTTTGTGTCCATACTCTTGCAGTTCTGTCCTTG 

GACTTGATGCCGCTGAACTCTGCGGTGGGACCGGTCCCGTCAGAGCCTGGTGTACTGGGGGGA 

GGGAGGGAGGAGGGAGCCTGTGCTGACGGAGCACCTCGCCGGGTGTGCCCCTCCTGGGCTGTG 

TGACCCCAGCCTCCCCACCCACCTCCTGCTTTGTGTACTCCTCCCCTCCCCCTCAGCACAATC 

GGAGTTCATATAAGAAGTGCGGGAGCTTCTCTGGTCAGGGTTCTCTGAACACTTATGGAGAGA 

GTGCTTCCTGGGAAGTGTGGCGTTTGAAGGGGCTGGAGGGCAGGTCTTTAAGATGGCGAGACT 

GCCCTTCTCAGCTGATAAACACAAGAACGGCGATCCTGTCTTCAGTAAGGCTCCACGAGAAGA 

GAGGAAGTATATCTACACCTCAACCCTCCTAGTCACCACCTGAAATAAATGTTAGGGAAAAAAAA 
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214/550 

FIGURE 214 

MAVIIGVAVGAGVAFLVLMATIVAFCCARSQRNLKGVVSAKNDIRVEIVHKEPASGREGEEHS 
TIKQLMMDRGEFQQDSVLKQLEVLKEEEKEFQNLKDPTNGYYSVNTFKEHHSTPTISLSSCQP 
DLRPAGKQRVPTGMSFTNIYSTLSGQGRLYDYGQRFVLGMGSSSIELCEREFQRGSLSDSSSF 
LDTQCDSSVSSSGKQDGYVQFDKASKASASSSHHSQSSSQNSDPSRPLQRRMQTHV 

Signal peptide: 

amino acids 1-28 

Glycosaminoglycan attachment site. 

amino acids 150-154 

N-myristoylation sites. 

amino acids 6-12, 10-16, 36-42 f 139-145, 165-171 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 114-125 



r 
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215/550 

FIGURE 215 

CAGCCTTCCTCCCCCAGCCTGAGTGACTACTCTATTCCTTGGTCCCTGCTATTGTCGGGGACG 

ATTGCATSGGCTACGCCAGGAAAGTAGGCTGGGTGACCGCAGGCCTGGTGATTGGGGCTGGCG 

CCTGCTATTGCATTTATAGACTGACTAGGGGAAGAAAACAGAACAAGGAAAAAATGGCTGAGG 

GTGGATCTGGGGATGTGGATGATGCTGGGGACTGTTCTGGGGCCAGGTATAATGACTGGTCTG 

ATGATGATGATGACAGCAATGAGAGCAAGAGTATAGTATGGTACCCACCTTGGGCTCGGATTG 

GGACTGAAGCTGGAACCAGAGCTAGGGCCAGGGCAAGGGCCAGGGCTACCCGGGCACGTCGGG 

CTGTCCAGAAACGGGCTTCCCCCAATTCAGATGATACCGTTTTGTCCCCTCAAGAGCTACAAA 

AGGTTCTTTGCTTGGTTGAGATGTCTGAAAAGCCTTATATTCTTGAAGCAGCTTTAATTGCTC 

TGGGTAACAATGCTGCTTATGCATTTAACAGAGATATTATTCGTGATCTGGGTGGTCTCCCAA 

TTGTCGCAAAGATTCTCAATACTCGGGATCCCATAGTTAAGGAAAAGGCTTTAATTGTCCTGA 

ATAACTTGAGTGTGAATGCTGAAAATCAGCGCAGGCTTAAAGTATACATGAATCAAGTGTGTG 

ATGACACAATCACTTCTCGCTTGAACTCATCTGTGCAGCTTGCTGGACTGAGATTGCTTACAA 

ATATGACTGTTACTAATGAGTATCAGCACATGCTTGCTAATTCCATTTCTGACTTTTTTCGTT 

TATTTTCAGCGGGAAATGAAGAAACCAAACTTCAGGTTCTGAAACTCCTTTTGAATTTGGCTG 

AAAATCCAGCCATGACTAGGGAACTGCTCAGGGCCCAAGTACCATCTTCACTGGGCTCCCTCT 

TTAATAAGAAGGAGAACAAAGAAGTTATTCTTAAACTTCTGGTCATATTTGAGAACATAAATG 

ATAATTTCAAATGGGAAGAAAATGAACCTACTCAGAATCAATTCGGTGAAGGTTCACTTTTTT 

TCTTTTTAAAAGAATTTCAAGTGTGTGCTGATAAGGTTCTGGGAATAGAAAGTCACCATGATT 

ttttggtgaaagtaaaagttggaaaattcatggccaaacttgctgaacatAtgttcccaaaga 

GCCAGGAAraACACCTTGATTTTGTAATTTAGAAGCAACACACATTGTAAACTATTCATTTTC 
TCCACCTTGTTTATATGGTAAAGGAATCCTTTCAGCTGCCAGTTTTGAATAATGAATATCATA 
TTGTATCATCAATGCTGATATTTAACTGAGTTGGTCTTTAGGTTTAAGATGGATAAATGAATA 
TCACTACTTGTTCTGAAAACATGTTTGTTGCTTTTTATCTCGCTGCCTAGATTGAAATATTTT 
GCTATTTCTTCTGCATAAGTGACAGTGAACCAATTCATCATGAGTAAGCTCCCTTCTGTCATT 
TTCATTGATTTAATTTGTGTATCATCAATAAAATTGTATGTTAATGCTGGAAAGA 
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216/550 

FIGURE 216 

MGYARKVGWVTAGLVIGAGACYCIYRLTRGRKQNKEKMAEGGSGDVDDAGDCSGARYNDWSDD 
DDDSNESKSIVWYPPWARIGTEAGTRARARARARATRARRAVQKRASPNSDDTVLSPQELQKV 
LCLVEMSEKPYILEAALIALGNNAAYAFNRDIIRDLGGLPIVAKILNTRDPIVKEKALIVLNN 
LSVNAENQRRLKVYMNQVCDDTITSRLNSSVQLAGLRLLTNMTVTNEYQHMLANSISDFFRLF 
SAGNEETKLQVLKLLLNLAENPAMTRELLRAQVPSSLGSLFNKKENKEVILKLLVIFENINDN 
FKWEENEPTQNQFGEGSLFFFLKEFQVCADKVLGIESHHDFLVKVKVGKFMAKLAEHMFPKSQE 

Signal peptide: 

amino acids 1-20 

N-glycosylation sites . 

amino acids 68-72, 189-193, 217-221, 230-234 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 107-111 

N-myristoylation sites. 

amino acids 13-19, 17-23, 19-25, 54-60, 83-89, 147-153, 255-261, 
290-296 

Ami da t ion site. 

amino acids 29-33 
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FIGURE 217 

GAGACACAAAGGCAGGCGGGATGCGGGAGCAGGCAAAGGGAAAGCGAAAGCCGCGCGCCCGGC 

CGGTGACTGGGTGAAGGCGCCGCGCAGCTTTCCCGACGCCGGCTGTACCCGGACCTCCTGGTC 

GAGCCTGGCGCGCCGCAGCCAT6GCCATCGCTCAACTGGCCACGGAGTACGTGTTCTCGGATT 

TCTTGCTGAAGGAGCCCACGGAGCCCAAGTTCAAGGGGCTGCGACTGGAGCTGGCTGTGGACA 

AGATGGTCACGTGCATTGCGGTGGGGCTGCCCCTGCTGCTCATCTCGCTGGCCTTCGCGCAGG 

AGATCTCGATTGGTACACAGATAAGCTGTTTCTCTCCAAGTTCTTTCTCCTGGCGTCAGGCTG 

CCTTTGTGGATTCATATTGCTGGGCGGCTGTTCAGCAGAAGAACTCACTGCAGAGCGAGTCTG 

GAAACCTCCCACTGTGGCTGCATAAGTTTTTCCCCTACATCCTGCTGCTCTTTGCGATCCTCC 

TGTACCTGCCCCCGCTGTTCTGGCGTTTCGCAGCTGCTCCTCATATTTGCTCAGACTTGAAGT 

TTATCATGGAAGAACTTGACAAAGTTTACAACCGTGCAATTAAGGCTGCAAAGAGTGCGCGTG 

ACCTTGACATGAGAGATGGAGCCTGCTCAGTTCCAGGTGTTACCGAGAACTTAGGGCAAAGTT 

TGTGGGAGGTATCTGAAAGCCACTTCAAGTACCCAATTGTGGAGCAGTACTTGAAGACAAAGA 

AAAATTCTAATAATTTAATCATCAAGTACATTAGCTGCCGCCTGCTGACACTCATCATTATAC 

TGTTAGCGTGTATCTACCTGGGCTATTACTTCAGGCTCTCCTCACTCTCAGACGAGTTTGTGT 

GCAGCATCAAATCAGGGATCCTGAGAAACGACAGCACCGTGCCCGATCAGTTTCAGTGCAAAC 

TCATTGCCGTGGGCATCTTCCAGTTGCTCAGTGTCATTAACCTTGTGGTTTATGTCCTGCTGG 

CTCCCGTGGTTGTCTACACGCTGTTTGTTCCATTCCGACAGAAGACAGATGTTCTCAAAGTGT 

ACGAAATCCTCCCCACTTTTGATGTTCTGCATTTCAAATCTGAAGGGTACAACGATTTGAGCC 

TCTACAATCTCTTCTTGGAGGAAAATATAAGTGAGGTCAAGTCATACAAGTGTCTTAAGGTAC 

TGGAGAATATTAAGAGCAGTGGTCAGGGGATCGACCCAATGCTACTCCTGACAAACCTTGGCA 

TGATCAAGATGGATGTTGTTGATGGCAAAACTCCCATGTCTGCAGAGATGAGAGAGGAGCAGG 

GGAACCAGACGGCAGAGCTCCAAGGTATGAACATAGACAGTGAAACTAAAGCAAATAATGGAG 

AGAAGAATGCCCGACAGAGACTTCTGGATTCTTCTTGCTGATGATTTTTTTTCCTTGAGCTGT 

AAATCTGTGACTTCTGCGACATGGGATTTAATTTGGCTAAAGCACCCCTGTTGGTTTCACAGC 

TGGTTTGCAATAAATGGTTCTTGGTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

jyyy^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 218 

MAIAQLATEYVFSDFLLKEPTEPKFKGLRLELAVDKMVTCIAVGLPLLLISLAFAQEISIGTQ 
ISCFSPSSFSWRQAAFVDSYCWAAVQQKNSLQSESGNLPLWLHKFFPYILLLFAILLYLPPLF 
WRFAAAPHICSDLKFIMEELDKVYNRAIKAAKSARDLDMRDGACSVPGVTENLGQSLWEVSES 
HFKYPIVEQYLKTKKNSNNLIIKYISCRLLTLIIILLACIYLGYYFSLSSLSDEFVCSIKSGI 
LRNDSTVPDQFQCKLIAVGIFQLLSVINLWYVLLAPVWYTLFVPFRQKTDVLKVYEILPTF 
DVLHFKSEGYNDLSLYNLFLEENISEVKSYKCLKVLENIKSSGQGIDPMLLLTNLGMIKMDVV 
DGKTPMSAEMREEQGNQTAELQGMNIDSETKANNGEKNARQRLLDSSC 

Transmembrane domains: 

amino acids 37-55, 108-126, 216-232, 273-290 

N-glycosylation sites. 

amino acids 255-259, 338-342, 394-398 

Glycosaminoglycan attachment site. 

amino acids 357-361 

cAMP- and. cGMP-dependent protein kinase phosphorylation site. 

♦ 

amino acids 203-207 
N-myristoylation sites . 

amino acids 61-67 , 174-180, 251-257, 393-399 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 218-229 
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FIGURE 219 

CTGTGAGTGACACACGCTGAGTGGGGTGAAGGGAAATGCTGGTGAATTTCATTTTGAGGTGTG 

GGTTGCTGTTAGTCACTCTGTCTCTTGCCATTGCCAAGCACAAGCAATCTTCCTTCACCAAAA 

GTTGTTACCCAAGGGGAACATTGTCCCAAGCTGTTGACGCTCTCTATATCAAAGCAGCATGGC 

TCAAAGCAACGATTCCAGAAGACCGCATAAAAAATATACGATTATTAAAAAAGAAAACAAAAA 

AGCAGTTTATGAAAAACTGTCAATTTCAAGAACAGCTTCTGTCCTTCTTCATGGAAGACGTTT 

TTGGTCAACTGCAATTGCAAGGCTGCAAGAAAATACGCTTTGTGGAGGACTTTCATAGCCTTA 

GGCAGAAATTGAGCCACTGTATTTCCTGTGCTTCATCAGCTAGAGAGATGAAATCCATTACCA 

GGATGAAAAGAATATTTTATAGGATTGGAAACAAAGGAATCTACAAAGCCATCAGTGAACTGG 

ATATTCTTCTTTCCTGGATTAAAAAATTATTGGAAAGCAGTCAGTAAACCAAAGCCAAGTACA 

TTGATTTTACAGTTATTTTGAAATACAATAAGAACTGCTAGAAATATGTTTATAACAGTCTAT 

TTCTTTTAAAAACTTTTTAACATAATACTGACGGCATGTTAGGTGATTCAGAATAGACAAGAA 

GGATTTAGTAAATTAACGTTTTGGATATAAGTTGTCACTAATTTGCACATTTTCTGTGTTTTC 

AAATAATGTTTCCATTCTGAACATGTTTTGTCATTCACAAGTACATTGTGTCAACTTAATTTA 

AAGTATGTAACCTGAATTAACTCGTGTAATATTTGTGTGTGGAGTGGGATGTGGGGGGTGGAG 

GGGGAATGACAGATTTCTGGAATGCAATGTAATGTTACTGAGACTTAAATAGATGTTATGTAT 

ATGATTGTCTGTTTAAGTGTTTGAAAATTGTTAATTATGCCCAGTGTGAACTTAGTACTTAAC 

ACATTTTGATTTTAATTAAATAAATTGGGTTTCCTTCTCAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAA ■ 
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FIGURE 220 

MLVNFILRCGLLLVTLSLAIAKHKQSSFTKSCYPRGTLSQAVDALYIKAAWLKATIPEDRIKN 
IRLLKKKTKKQFMKNCQFQEQLLSFFMEDVFGQLQLQGCKKIRFVEDFHSLRQKLSHCISCAS 
SAREMKSITRMKRIFYRIGNKGIYKAISELDILLSWIKKLLESSQ 

Signal sequence: 

amino acids 1-21 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 68-71 

N-myristoylation site. 

amino acids 148-153 

Interleukin-10 proteins . 

amino acids 58-94, 74-102, 128-170 
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FIGURE 221 

GACCACGGCCCTGCGCCCCAGCCAGGCCTGAGGACATGAGGCGGCCGGCGGCGGTGCCGCTCC 

TGCTGCTGCTGTGTTTTGGGTCTCAGAGGGCCAAGGCAGCAACAGCCTGTGGTCGCCCCAGGA 

TGCTGAACCGAATGGTGGGCGGGCAGGACACGCAGGAGGGCGAGTGGCCCTGGCAAGTCAGCA 

TCCAGCGCAACGGAAGCCACTTCTGCGGGGGCAGCCTCATCGCGGAGCAGTGGGTCCTGACGG 

CTGCGCACTGCTTCCGCAACACCTCTGAGACGTCCCTGTACCAGGTCCTGCTGGGGGCAAGGC 

AGCTAGTGCAGCCGGGACCACACGCTATGTATGCCCGGGTGAGGCAGGTGGAGAGCAACCCCC 

TGTACCAGGGCACGGCCTCCAGCGCTGACGTGGCCCTGGTGGAGCTGGAGGCACCAGTGCCCT 

TCACCAATTACATCCTCCCCGTGTGCCTGCCTGACCCCTCGGTGATCTTTGAGACGGGCATGA 

ACTGCTGGGTCACTGGCTGGGGCAGCCCCAGTGAGGAAGACCTCCTGCCCGAACCGCGGATCC 

TGCAGAAACTCGCTGTGCCCATCATCGACACACCCAAGTGCAACCTGCTCTACAGCAAAGACA 

CCGAGTTTGGCTACCAACCCAAAACGATCAAGAATGACATGCTGTGCGCCGGCTTCGAGGAGG 

GCAAGAAGGATGCCTGCAAGGGCGACTCGGGCGGCCCCCTGGTGTGCCTCGTGGGTCAGTCGT 

GGCTGCAGGCGGGGGTGATCAGCTGGGGTGAGGGCTGTGCCCGCCAGAACCGCCCAGGTGTCT 

ACATCCGTGTCACCGCCCACCACAACTGGATCCATCGGATCATCCCCAAACTGCAGTTCCAGC 

CAGCGAGGTTGGGCGGCCAGAAGTGAGACCCCCGGGGCCAGGAGCCCCTTGAGCAGAGCTCTG 

CACCCAGCCTGCCCGCCCACACCATCCTGCTGGTCCTCCCAGCGCTGCTGTTGCACCTGTGAG 

CCCCACCAGACTCATTTGTAAATAGCGCTCCTTCCTCCCCTCTCAAATACCCTTATTTTATTT 

ATGTTTCTCCCAATAAAAACCCAGCCTGTGTGCCAGCTGAAAAAAAAAAAAAAAAAAA 
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FIGURE 222 

MRRPAAVPLLLLLCFGSQRAKAATACGRPRMLNRMVGGQDTQEGEWPWQVSIQRNGSHFCGGS 
LIAEQWVLTAAHCFRNTSETSLYQVLLGARQLVQPGPHAMYARVRQVESNPLYQGTASSADVA 
LVELEAPVPFTNYILPVCLPDPSVIFETGMNCWVTGWGSPSEEDLLPEPRILQKLAVPIIDTP 
KCNLLYSKDTEFGYQPKTIKNDMLCAGFEEGKKDACKGDSGGPLVCLVGQSWLQAGVISWGEG 
CARQNRPGVYIRVTAHHNWIHRIIPKLQFQPARLGGQK 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation sites. 

amino acids 55-58, 79-82 

Casein kinase II phosphorylation sites . 

amino acids 121-124, 165-168, 167-170, 248-251 

Tyrosine kinase phosphorylation sites. 

amino acids 78-86, 197-203 

N-myristoylation sites . 

amino acids 16-21, 37-42, 56-61, 62-67, 118-123 

Amidation site. 

amino acids 219-222 

Serine proteases, trypsin family, histidine active site. 

amino acids 71-76 
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FIGURE 223 

CAAGATGTGGACAGCTCTTGTGCTCATTTGGATTTTCTCCTTGTCCTTATCTGAAAGCCATGC 

GGCATCCAACGATCCACGCAACTTTGTCCCTAACAAAATGTGGAAGGGATTAGTCAAGAGGAA 

TGCATCTGTGGAAACAGTTGATAATAAAACGTCTGAGGATGTAACCATGGCAGCAGCTTCTCC 

TGTCACATTGACCAAAGGGACTTCGGCAGCCCACCTCAACTCTATGGAAGTCACAACAGAGGA 

CACAAGCAGGACAGATGTGAGTGAACCAGCAACTTCAGGAGTTGCAGCTGATGGTGTGACCTC 

CATTGCTCCCACGGCTGTGGCCTCCAGTACGACTGCGGCCTCCATTACGACTGCGGCCTCCAG 

TATGACTGTGGCCTCCAGTGCTCCCACGACTGCAGCCTCCAGTACAACTGTGGCCTCCATTGC 

TCCCACGACTGCAGCCTCCAGTATGACTGCGGCCTCCAGCACTCCCATGACACTTGCACTCCC 

CGCGCCCACGTCCACTTCCACAGGGCGGACCCCGTCCACTACCGCCACTGGGCATCCATCTCT 

CAGCACAGCCCTCGCACAAGTGCCAAAGAGCAGCGCGTTGCCAAGAACAGCAACCCTGGCCAC 

ATTGGCCACACGTGCTCAGACTGTAGCGACCACAGCAAACACAAGCAGCCCCATGAGCACTCG 

TCCAAGTCCTTCCAAGCACATGCCCAGTGACACCGCGGCAAGCCCTGTACCCCCTATGCGTCC 

CCAAGCACAAGGTCCCATTAGCCAGGTGTCAGTGGACCAGCCTGTGGTTAACACAACAAATAA 

ATCCACACCCATGCCCTCAAACACAACCCCAGAGCCCGCCCCCACCCCCACAGTGGTGACCAC 

CACCAAGGCACAAGCCAGGGAGCCAACTGCCAGCCCAGTGCCAGTACCTCACACCAGCCCAAT 

CCCTGAGATGGAGGCCATGTCCCCCACGACACAGCCAAGCCCCATGCCATATACCCAGAGGGC 

CGCTGGGCCAGGCACATCCCAGGCACCGGAGCAGGTAGAGACTGAAGCCACACCAGGTACTGA 

TTCCACTGGGCCAACACCCAGGAGCTCAGGGGGCACTAAGATGCCAGCCACGGACTCGTGCCA 

gcccagcacccaaggccagtacAtggtggtcaccactgagcccctcacccaggccgtggtaga 

CAAAACTCTCCTTCTGGTGGTGCTGTTACTCGGGGTGACCGTTTTCATCACAGTCTTGGTTTT 
GTTTGCCCTGCAGGCCTATGAGAGCTACAAGAAGAAGGACTACACCCAGGTGGACTACTTAAT 
CAACGGGATGTATGCGGACTCAGAAATGT6AGGGGGGCGGGGGCCTGGCGGGAGGCCTGGCCC 
CTTCCTCGTCCTTTCCTTTTGCCTTTGAGACCAAACCAAGTGCTTCCAAATTCTTTTGGTGCA 
ATTGAGGAGATATGCCAGATGCTTAAACACATTTAATTGCTGTCAGATTAATTCCATGATCAC 
TAAAGAGTTGCTGCTTTTTTCATATTTATTTTTGTAAATGATTCTGTGCCCAGGAGCAGCTGG 
GGGTTCCACCTCAGGGTGGGGCGGGCAGGACCCCGTCTCCCCAGGTGTCGGAGCCTGACCTGA 

ATTAAAGTACTGACTGCTCGCCA 
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FIGURE 224 

MWTALVLIWIFSLSLSESHAASNDPRNFVPNKMWKGLVKRNASVETVDNKTSEDVTMAAASPV 
TLTKGTSAAHLNSMEVTTEDTSRTDVSEPATSGVAADGVTSIAPTAVASSTTAASITTAASSM 
TVASSAPTTAASSTTVASIAPTTAASSMTAASSTPMTLALPAPTSTSTGRTPSTTATGHPSLS 
TALAQVPKSSALPRTATLATLATRAQTVATTANTSSPMSTRPSPSKHMPSDTAASPVPPMRPQ 
AQGPISQVSVDQPVVNTTNKSTPMPSNTTPEPAPTPTWTTTKAQAREPTASPVPVPHTSPIP 
EMEAMSPTTQPSPMPYTQRAAGPGTSQAPEQVETEATPGTDSTGPTPRSSGGTKMPATDSCQP 
STQGQYMWTTEPLTQAWDKTLLLWLLLGVTLFITVLVLFALQAYESYKKKDYTQVDYLIN 

GMYADSEM 

Signal peptide: 

amino acids 1-20 

Transmembrane domain : 

amino acids 396-420 

N-glycosylation sites. 

amino acids 41-44, 49-52, 222-225, 268-271, 271-274 
Casein kinase II phosphorylation sites. 

amino acids 14-17, 51-54, 80-83, 85-88, 280-283, 434-437 

N-myristoylation sites. 

amino acids 68-73, 354-359 

Aldo/keto reductase family putative active site signature. 

amino acids 19"5-210 
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FIGURE 225 

GGAAGGCGCTCAAGGTGCGCGGCCCGGGGCGCGCTACTGGGGGCGCCCTCCGCGGTGGGCAGC 

GCGCCAGGGATCGGCCTGGGCAGCCGCGGGGCGCGCGAAGGCTGCGCTTTCCCTACGGCCCCC 

CTCGCTTCCTCCGGCACGGCGGCAACGGAGATTTCCTCTCGGGGAAACTACGCGGATCCTTTT 

CGGGGATCCTCGCCCCGCCCCAGTTCTCCGCCCCCTCCCCTTTGCTGGGGCGCCTGGGCTGGC 

CCGCGCAGGGGAGGAGGCTCTGGCAGCCTGGGCAGGGAGGCGGCGGGGGGCCGCGGAGCCGCT 

GGCCATCGATTCTCCCCGCCATGTGACGCCGTCCTTAGCCCTGCGACCCCCAGCGCGTCCCGG 

GCCTGCGCCTCCGCCCCGCCGCGCAGCGCACGAT6CTTCTGCCGGGACGCGCACGCCAACCGC 

CGACGCCCCAGCCCGTGCAGCATCCCGGCCTCCGCCGGCAGGTAGAGCCGCCGGGGCAGCTCC 

TGCGCCTCTTCTACTGCACTGTCCTGGTCTGCTCCAAAGAGATCTCAGCGCTCACCGACTTCT 

CTGGTTACCTAACCAAACTCCTGCAAAACCACACCACCTATGCCTGTGATGGGGACTATTTGA 

ATCTACAGTGCCCTCGGCATTCTACGATAAGTGTCCAATCGGCATTTTATGGGCAAGATTACC 

AAATGTGTAGTTCCCAGAAGCCTGCCTCCCAGAGGGAAGACAGCTTAACCTGTGTGGCAGCCA 

CCACCTTCCAGAAGGTGCTGGACGAATGCCAGAACCAGCGGGCCTGCCACCTCCTGGTCAATA 

GCCGTGTTTTTGGACCTGACCTTTGTCCAGGAAGCAGTAAATACCTCCTGGTCTCCTTTAAAT 

GCCAACCTAATGAATTAAAAAACAAAACCGTGTGTGAAGACCAGGAGCTGAAACTGCACTGCC 

ATGAATCCAAGTTCCTCAACATCTACTCTGCGACCTACGGCAGGAGGACCCAGGAAAGGGACA 

TCTGCTCCTCCAAGGCAGAGCGGCTCCCCCCTTTCGATTGCTTGTCTTACTCAGCTTTGCAAG 

TCCTATCCCGAAGGTGCTATGGGAAGCAGAGATGCAAAATCATCGTCAACAATCACCATTTTG 

GAAGCCCCTGTTTGCCAGGCGTGAAAAAATACCTCACTGTGACCTACGCATGTGTTCCCAAGA 

ACATACTCACAGCGATTGATCCAGCCATTGCTAATCTAAAACCTTCTTTGAAGCAGAAAGATG 

GTGAATATGGTATAAACTTCGACCCAAGCGGATCGAAGGTTCTGAGGAAAGATGGAATTCTTG 

TTAGCAACtCTCTGGCAGCCTTTGCTTACATTAGAGCCCACCCAGAGAGAGCTGCCCTGCTGT 

TCGTGTCCAGTGTCTGCATCGGCCTGGCCCTCACACTGTGCGCCCTGGTCATCAGAGAGTCCT 

GTGCCAAGGACTTCCGCGACTTGCAGCTGGGGAGGGAGCAGCTGGTGCCAGGAAGTGACAAGG 

TCGAGGAGGACAGCGAGGATGAAGAAGAGGAGGAGGACCCCTCTGAGTCTGATTTCCCAGGGG 

AACTGTCGGGGTTCTGTAGGACTTCATATCCTATATACAGTT.CCATAGAAGCTGCAGAGCTCG 

CAGAAAGGATTGAGCGCAGGGAGCAAATCATTCAGGAAATATGGATGAACAGTGGTTTGGACA 

CCTCGCTCCCAAGAAACATGGGCCAGTTCTACT^AAACCACATGCATCTTGATGCGATCGCA 

CTTTCTGAAGAAGGAAGGATCCCAAATGCCCCTCCAGTTCTGGTTCACCTGTACCTTCTATGA 

AGGAGAATTCGTCATGTCATTCAACACTCGTGAGGCCAGGAAGCTATTAAAGGGATGTTTCAA 

GCTGTTTCTAGCACATTCCAAAATAAATGAGGAGGGAGGAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAA 
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FIGURE 226 

MLLPGRARQPPTPQPVQHPGLRRQVEPPGQLLRLFYCTVLVCSKEISALTDFSGYLTKLLQNH 
TTYACDGDYLNLQCPRHSTISVQSAFYGQDYQMCSSQKPASQREDSLTCVAATTFQKVLDECQ 
NQRACHLLVNSRVFGPDLCPGSSKYLLVSFKCQPNELKNKTVCEDQELKLHCHESKFLNIYSA 

T YGRRTQERDICSSKAERLPPFDCLS YSALQVLSRRCYGKQRCKI IVNNHHFGS PCLPGVKKY 
LTVTYACVPKNILTAIDPAIANLKPSLKQKDGEYGINFDPSGSKVLRKDGILVSNSLAAFAYI 
RAHPERAALLFVSSVCIGLALTLCALVIRESCAKDFRDLQLGREQLVPGSDKVEEDSEDEEEE 
EDPSESDFPGELSGFCRTSYPIYSSIEAAELAERIERREQIIQEIWMNSGLDTSLPRNMGQFY 

Transmembrane domains : 

amino acids 32-49, 322-343 

N-glycosylation sites. 

amino acids 62-66, 165-169 

Tyrosine kinase phosphorylation site. 

amino acids 280-287 

N-myristoylation site, 

amino acids 302-308, 333-339, 428-434 



Amidation site. 

amino acids 191-195 
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FIGURE 227 

GGCACGAGGTGGAAGGGCTTTTACAAACAGATTGCTGGCCCCACCCCCCAGAATTTCTCATCA 
GGAGTGGGCAAGACCAATCATTTGCATTTCTGACAAGTTCCCAGGAGCTGCAGCTGCTGGCCC 
TGGAACCACACTTTGAGAACCACTGCTTTAGACCAAACACCAAAGGAAGATGCAGCCACCCTC 
CTTTACATGTCACAACGCTCAGGGTCCATGAGTACCTCAGGCTGTCCAGCTGAGCTCCACCTG 
CAGCAGCCGAGATTCCCGACTCGCTCCACCATTGGGGGCTAGGAGTGAAGCGTGTCACCAT6G 
TCAGCTCATGGCCAGCCAGGAAAGCCTCTCTGCTGTGCGTCTGTGCAGTTCTTGTTCTTCCCT 
GGAGGACTCTTGGATCGCCTGTGATCTTGGCCAGGAGACCAGGTGCCTGGGTCCCTTCCTGGA 
AGGGGACAAGTTACACACCCCAGCCCCATTTTCCCACCAACTTCTACATGCCTTGGGAGAACC 
TTCTACATGTTGGCTGCCCCCTTCCCCTATTTCAGCAGTGCCCAGTCCTGCTTATAAACCTGA 
GGCCTGCTCCCCATACCTTCCCTGTGCAAGTGCCAGCCGTTATTCCAGGCAGCCCAATGTTGT 
TGAGGCCAGATGGATTCCTGGAAGCAGCTGGCCCATGGATGI^GTCATCACAGTATTCTAGA 
AACAGAGAAGAGGTCTTAACCTAATGCGCATAGAGAAATTGTTCTCATTGTAAACATACCCCT 
GTCCTTAGCTGATCTAGGTGGAAGCCCAGCTTCATGTGCTAGGGGGCATGATAATGATAATAA 

AGGAATTGTATCTAGGACTAA 
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FIGURE 228 

MVSSWPARKASLLCVCAVLVLPWRTLGSPVILARRPGAWVPSWKGTSYTPQPHFPTNFYMPWE 
NLLHVGCPLPLFQQCPVLLINLRPAPHTFPVQVPAVIPGSPMLLRPDGFLEAAGPWM 

Signal peptide: 

amino acids 1-27 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 8-12 
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FIGURE 229 

GGGAAGGGATGCAAGGAAGCCCTCCGGCGCTGCGCTCCGAGGCGGGAGACAGCGTCCCGCTGA 

AAATGTGTGTCTGACATGCAAGCTCAGTGGGGCAGAGACCCGTGGATTGCTGTGCCCTGCCCT 

CCGGACCTGGATCATGAAGGTGTTGGGAAGAAGCTTCTTCTGGGTGCTGTTTCCCGTCCTTCC 

CTGGGCGGTGCAGGCTGTGGAGCACGAGGAGGTGGCGCAGCGTGTGATCAAACTGCACCGCGG 

GCGAGGGGTGGCTGCCATGCAGAGCCGGCAGTGGGTCCGGGACAGCTGCAGGAAGCTCTCAGG 

GCTTCTCCGCCAGAAGAATGCAGTTCTGAACAAACTGAAAACTGCAATTGGAGCAGTGGAGAA 

AGACGTGGGCCTGTCGGATGAAGAGAAACTGTTTCAGGTGCACACGTTTGAAATTTTCCAGAA 

AGAGCTGAATGAAAGTGAAAATTCCGTTTTCCAAGCTGTCTACGGACTGCAGAGAGCCCTGCA 

GGGGGATTACAAAGATGTCGTGAACATGAAGGAGAGCAGCCGGCAGCGCCTGGAGGCCCTGAG 

AGAGGCTGCAATAAAGGAAGAAACAGAATATATGGAACTTCTGGCAGCAGAAAAACATCAAGT 

TGAAGCCCTTAAAAATATGCAACATCAAAACCAAAGTTTATCCATGCTTGACGAGATTCTTGA 

AGATGTAAGAAAGGCAGCGGATCGTCTGGAGGAAGAGATAGAGGAACATGCTTTTGACGACAA 

TAAATCAGTCAAGGGGGTCAATTTTGAGGCAGTTCTGAGGGTGGAGGAAGAAGAGGCCAATTC 

TAAGCAAAATATAACAAAACGAGAAGTGGAGGATGACTTGGGTCTTAGCATGCTGATTGACTC 

CCAGAACAACCAGTATATTTTGACCAAGCCCAGAGATTCAACCATCCCACGTGCAGATCACCA 

CTTTATAAAGGACATTGTTACCATAGGAATGCTGTCCTTGCCTTGTGGCTGGCTATGTACAGC 

CATAGGATTGCCTACAATGTTTGGTTATATTATTTGTGGTGTACTTCTGGGACCTTCAGGACT 

AAATAGTATTAAGTCTATTGTGCAAGTGGAGACATTAGGAGAATTTGGGGTGTTTTTTACTCT 

TTTTCTTGTTGGCTTAGAATTTTCTCCAGAAAAGCTAAGAAAGGTGTGGAAGATTTCCTTACA 

AGGGCCGTGTTACATGACACTGTTAATGATTGCATTTGGCTTGCTGTGGGGGCATCTCTTGCG 

GATCAAACCCACGCAGAGCGTCTTCATTTCCACGTGTCTGTCCTTGTCAAGCACACCCCTCGT 

GTCCAGGTTCCTCATGGGCAGTGCTCGGGGTGACAAAGAAGGCGACATTGACTACAGCACCGT 

GCTCCTCGGCATGCTGGTGACGCAGGACGTGCAGCTCGGGCTCTTCATGGCCGTCATGCCGAC 

TCTCATACAGGCGGGCGCCAGTGCATCTTCTAGCATTGTCGTGGAAGTTCTCCGAATCCTGGT 

TTTGATTGGTCAGATTCTTTTTTCACTAGCGGCGGTTTTTCTTTTATGTCTTGTTATAAAGAA 

GTATCTCATTGGACCCTATTATCGGAAGCTGCACATGGAAAGCAAGGGGAACAAAGAAATCCT 

GATCTTGGGAATATCTGCCTTTATCTTCTTAATGTTAACGGTCACGGAGCTGCTGGACGTCTC 

CATGGAGCTGGGCTGTTTCCTGGCTGGAGCGCTCGTCTCCTCTCAGGGCCCCGTGGTCACCGA 

GGAGATCGCCACCTCCATCGAACCCATCCGCGACTTCCTGGCCATCGTTTTCTTCGCCTCCAT 

AGGGCTCCACGTGTTCCCCACGTTTGTGGCGTACGAGCTCACGGTGCTGGTGTTCCTCACCTT 

GTCAGTGGTGGTGATGAAGTTTCTCCTGGCGGCGCTGGTCCTGTCTCTCATTCTGCCGAGGAG 

CAGCCAGTACATCAAGTGGATCGTCTCTGCGGGGCTTGCCCAGGTCAGCGAGTTTTCCTTTGT 

CCTGGGGAGCCGGGCGCGAAGAGCGGGCGTCATCTCTCGGGAGGTGTACCTCCTTATACTGAG 

TGTGACCACGCTCAGCCTCTTGCTCGCCCCGGTGCTGTGGAGAGCTGCAATCACGAGGTGTGT 

GCCCAGACCGGAGAGACGGTCCAGCCTCTGATGGCTCGGAGATGATGGACCGTGGAAGGGAAG 

CGTCTGTGGGGAGTGAGCGCTTAGATGGCCAGCAGCTGCTCCTTCTGGGAAGCTCGCACCTTG 

GCAACAGAACAGCCCTCTAGCAGAGCGTCAGTGCAGTCGTGTTATCCCGGCTTTTACAGAATA 

TTCTTGTCCTATTTTAGAATTTTCCGGAGTAGTTTATTTGCAGTCTGTTGATTATGTGCAGTA 

GACCCGGGACACTGCGTTTTACCGATCACCTTGAATGTGGTGCCTGGATGTGCCTTTTTTTTT 

TTTCCCTGAAATTATTATTAATTTTCTATTGTGAGTTCATCAGTTCATAGTTTTTTTAGTAAA 

GAAGCAAAATTAAAAGGCTTTTAAAAATGTACAACTTCAGAATTATAATCTGTTAGTCAAATA 

TTTGTTATTAAACATTTCTGTAATATGAAGTTGTAATCCTGGCCGTGAGCTTGGAAGCTTACT 

TTTGATTCTTAAAGCCTATGTTTTCTAAAATGAGACAAATACGGATGTCTATTTGCCTTTTAT 

TGTAACTTTTAAATGAAATAATTTCATGTCAATTTCTATTAGATATATCACTTAAAATATTTG 

GTTTTAAATCACAAGAATATGTATTCTTTAATAAAGATAATTTATGATCATGGTAAAAAAAAAAA 
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FIGURE 230 

MKVLGRSFFWVLFPVLPWAVQAVEHEEVAQRVIKLHRGRGVAAMQSRQWVRDSCRKLSGLLRQ 
KNAVLNKLKTAIGAVEKDVGLSDEEKLFQVHTFEIFQKELNESENSVFQAVYGLQRALQGDYK 
DVVNMKESSRQRLEALREAAIKEETEYMELLAAEKHQVEALKNMQHQNQSLSMLDEILEDVRK 
AADRLEEEIEEHAFDDNKSVKGVNFEAVLRVEEEEANSKQNITKREVEDDLGLSMLIDSQNNQ 
YILTKPRDSTIPRADHHFIKDIVTIGMLSLPCGWLCTAIGLPTMFGYIICGVLLGPSGLNSIK 
SIVQVETLGEFGVFFTLFLVGLEFSPEKLRKVWKISLQGPCYMTLLMIAFGLLWGHLLRIKPT 
QSVFISTCLSLSSTPLVSRFLMGSARGDKEGDIDYSTVLLGMLVTQDVQLGLFMAVMPTLIQA 
GASASSSIWEVLRILVLIGQILFSLAAVFLLCLVIKKYLIGPYYRKLHMESKGNKEILILGI 
SAFIFLMLTVTELLDVSMELGCFLAGALVSSQGPWTEEIATSIEPIRDFLAIVFFASIGLHV 

F PT FVA YELT VLV FLTL S VWMKFLL AAL VL SLILPRSSQYIKWI VS AGL AQVS E FS FVLG S R 
ARRAGVISREVYLLILSVTTLSLLLAPVLWRAAITRCVPRPERRSSL 

Signal peptide: 

amino acids 1-22 

Transmembrane domains : 

amino acids 282-304, 322-337, 354-370, 379-395, 445-474, 501-520, 
576-598, 641-660 

N-glycosylation sites. 

amino acids 104-108, 174-178, 206-210, 230-234 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 55-59, 673-677 

Tyrosine kinase phosphorylation site. 

amino acids 407-414 

N-myristoylation sites. 

amino acids 116-122, 327-333, 366-372, 401-407, 419-425, 429-435, 
442-448, 525-531, 530-536 



Cell attachment sequence. 

amino acids 404-407 
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FIGURE 231 

GAGAAAAACAACAGGAAGCAGCTTACAAACTCGGTGAACAACTGAGGGAACCAAACCAGAGAC 

GCGCTGAAC AGAGAGAAT C AGGCT CAAAGCAAGT GG AAGT GG GC AG AG AT TCC ACCAGGACTG 
GTGCAAGGCGCAGAGCCAGCCAGATTTGAGAAGAAGGCAAAAAGATGCTGGGGAGCAGAGCTG 
TAATGCTGCTGTTGCTGCTGCCCTGGACAGCTCAGGGCAGAGCTGTGCCTGGGGGCAGCAGCC 
CTGCCTGGACTCAGTGCCAGCAGCTTTCACAGAAGCTCTGCACACTGGCCTGGAGTGCACATC 
CACTAGTGGGACACATGGATCTAAGAGAAGAGGGAGATGAAGAGACTACAAATGATGTTCCCC 
ATATCCAGTGTGGAGATGGCTGTGACCCCCAAGGACTCAGGGACAACAGTCAGTTCTGCTTGC 
AAAGGATCCACCAGGGTCTGATTTTTTATGAGAAGCTGCTAGGATCGGATATTTTCACAGGGG 
AGCCTTCTCTGCTCCCTGATAGCCCTGTGGGCCAGCTTCATGCCTCCCTACTGGGCCTCAGCC 
AACTCCTGCAGCCTGAGGGTCACCACTGGGAGACTCAGCAGATTCCAAGCCTCAGTCCCAGCC 
AGCCATGGCAGCGTCTCCTTCTCCGCTTCAAAATCCTTCGCAGCCTCCAGGCCTTTGTGGCTG 
TAGCCGCCCGGGTCTTTGCCCATGGAGCAGCAACCCTGAGTCCCTAAAGGCAGCAGCTCAAGG 
ATGGCACTCAGATCTCCATGGCCCAGCAAGGCCAAGATAAATCTACCACCCCAGGCACCTGTG 
AGCCAACAGGTTAATTAGTCCATTAATTTTAGTGGGACCTGCATATGTTGAAAATTACCAATA 
CTGACTGACATGTGATGCTGACCTATGATAAGGTTGAGTATTTATTAGATGGGAAGGGAAATT 
TGGGGATTATTTATCCTCCTGGGGACAGTTTGGGGAGGATTATTTATTGTATTTATATTGAAT 
TATGTACTTTTTTCAATAAAGTCTTATTTTTGTGGCTAAAAAAAAAAAAA 
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FIGURE 232 

MLGSRAVMLLLLLPWTAQGRAVPGGSSPAWTQCQQLSQKLCTLAWSAHPLVGHMDLREEGDEE 
TTNDVPHIQCGDGCDPQGLRDNSQFCLQRIHQGLIFYEKLLGSDIFTGEPSLLPDSPVGQLHA 
SLLGLSQLLQPEGHHWETQQIPSLSPSQPWQRLLLRFKILRSLQAFVAVAARVFAHGAATLSP 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Casein kinase II phosphorylation site. 

amino acids 64-67 

N-myristoylation sites. 

amino acids 25-30, 81-86, 122-127 
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FIGURE 233 

rA^GGOTCCAGC^ 

ScacStcScSS^ 

CTGTAAAAGAGGAGCCCTGGATGGGAACTCCCCTCCTCAACGCCAGGATGTAG 

?gSSSSaga?5g?Scctggg 



aSgCGATATGAAGCCT^^ 

CCTGGGGCCAGTGATCCGGGCTGAGGTGGGTGACACCATTCAGGTGGTCTTCT^ 

rrrrrTrrTRCTGTGCAGGGCTGGTGCCTTGGGTGCAGATGGCAAGCAGAAAGGGGTGGATAAAG^i i^i l 
TCTCTTCACTGTGTTGGATGAG^CAAGAG 



n^rrr arAArrRGTCTGAGAAGGACAGTTATGGTTACATTTTCCTGAGCAACAAGGATGGGCTCCTGGGTTCGAG 



AAATAAGTCTTGGTATTTGGAGGAAAATGTGGCAACCCATGGGTCCCAGGATCCAGG^ 
TGAAACTTTCTTGGAGAGCAATAAAATGCATGCAATCAATGGGAAACTCTATGCCAACCTTAGG^ 



ArrTATTGAAATTTCTAGAAATGTATCCTTCTCACAAAGTAGAGACCAAGAGAAAAACTCATTGATTGGGl ^ 
ACTTC^TTTC^AGGACTCAG^AAATTTCACTTTGAACTGAGG^ 

taaaggctaagaatataggcttgatgggaaattgaaggtaggctgagtattgggaat 
Sot?Sgacagcaaaccctatccattaaagtacttgttagaccaaaaaaaaaaam 
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FIGURE 234 

MWAMESGHLLWALLFMQSLWPQLTDGATRVYYLGIRDVQWNYAPKGRNVITNQPLDSDIVASS 
FLKSDKNRIGGTYKKTIYKEYKDDSYTDEVAQPAWLGFLGPVLQAEVGDVILIHLKNFATRPY 
TIHPHGVFYEKDSEGSLYPDGSSGPLKADDSVPPGGSHIYNWTIPEGHAPTDADPACLTWIYH 
SHVDAPRDIATGLIGPLITCKRGALDGNSPPQRQDVDHDFFLLFSVVDENLSWHLNENIATYC 
SDPASVDKEDETFQESNRMHAINGFVFGNLPELNMCAQKRVAWHLFGMGNEIDVHTAFFHGQM 
LTTRGHHTDVANIFPAT FVTAEMVPWEPGTWLISCQVNSHFRDGMQALYKVKSCSMAPPVDLL 
TGKVRQYFIEAHEIQWDYGPMGHDGSTGKNLREPGSISDKFFQKSSSRIGGTYWKVRYEAFQD 
ETFQEKMHLEEDRHLGILGPVIRAEVGDTIQVVFYNRASQPFSMQPHGVFYEKDYEGTVYNDG 
SSYPGLVAKPFEKVTYRWTVPPHAGPTAQDPACLTWMYFSAADPIRDTNSGLVGPLLVCRAGA 
LGADGKQKGVDKEFFLLFTVLDENKSWYSNANQAAAMLDFRLLSEDIEGFQDSNRMHAINGFL 
FSNLPRLDMCKGDTVAWHLLGLGTETDVHGVMFQGNTVQLQGMRKGAAMLFPHTFVMAIMQPD 
NLGTFEIYCQAGSHREAGMRAIYNVSQCPGHQATPRQRYQAARIYYIMAEEVEWDYCPDRSWE 
REWHNQSEKDSYGYIFLSNKDGLLGSRYKKAVFREYTDGTFRIPRPRTGPEEHLGILGPLIKG 
EVGDILTWFKNNASRPYSVHAHGVLESTTVWPLAAEPGEWTYQWNIPERSGPGPNDSACVS 
WIYYSAVDPIKDMYSGLVGPLAICQKGILEPHGGRSDMDREFALLFLIFDENKSWYLEENVAT 
HGSQDPGSINLQDETFLESNKMHAINGKLYANLRGLTMYQGERVAWYMLAMGQDVDLHTIHFH 
AESFLYRNGENYRADWDLFPGTFEWEMVASNPGTWLMHCHVTDHVHAGMETLFTVFSRTEH 
LSPLTVITKETEKVPPRDIEEGNVKMLGMQIPIKNVEMLASVLVAISVTLLLVVLALGGVVWY 
QHRQRKLRRNRRS I LDDS FKLLS FKQ 



Signal peptide: 

amino acids 1-21 
Transmembrane domain: 
amino acids 1109-1130 
N-glycosylation sites . 

amino acids 167-171, 239-243, 591-595, 717-721, 761-765, 832-836, 
876-880, 934-938 

Glycosaminoglycan attachment site. 

amino acids 871-875 

Tyrosine kinase phosphorylation sites. 

amino acids 82-90, 137-145, 494-502, 513-521 
N-myristoylation sites. 

amino acids 212-218, 313-319, 498-504, 566-572, 672-678, 778-784, 
843-849 

Multicopper oxidases signature 1. 

amino acids 344-365, 696-717, 1043-1064 
Multicopper oxidases signature 2. 

amino acids 1048-1060 
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FIGURE 235 

GGAAAGAGTGCTGGTACTACAACCAGGAAGTGACAGATAATGTGCTTTAAACTACATTAGAAAAGCTTCTCATAG 
CAAAACTGAGAGATTGAAGCAGTGATTATTTTTACATAGTTGTCATTAAATATTTGGAGCTCTGCTGTGCATAGA 
GATGGCAACATACTTAGAATACACAGCTTTCTGGGCCAGAAATTGATCTTCTGACTTTTGAGCCTTATCTGATTA 
CTGCTTGGTTCATCTTTATTTTGTTAAACTACTCTGTAGGCTGAAAGGGAGAGACTCTCCTTGGTTTGCAGAGCC 
TGACTAGACAGGAATTCTGGCAACTGCTCCAGCAGAACTATGGCACTGAGCTAGGTTTAAATGCTGAGGAGATGG 
AAAACTTGTCACTGTCGATTGAGGATGTGCAGCCAAGAAGTCCAGGAAGAAGCAGCTTGGATGACTCTGGGGAGA 
GAGATGAAAAATTATCCAAGTCAATCAGTTTTACCAGTGAATCAATTAGTCGGGTTTCAGAAACAGAGTCATTCG 
ATGGAAATTCATCAAAAGGAGGATTAGGCAAAGAGGAGTCCCAAAATGAGAAACAGACCAAAAAGAGTCTCTTAC 
CAACTTTGGAAAAGAAGTTAACTAGAGTGCCATCAAAGTCACTGGACTTGAATAAAAATGAATATCTTTCTCTGG 
ACAAAAGCAGCACTTCAGATTCTGTTGATGAAGAAAATGTTCCTGAGAAAGATCTTCATGGAAGACTTTTTATCA 
ACCGTATTTTTCATATCAGTGCTGACAGAATGTTTGAATTGCTCTTTACCAGTTCACGCTTTATGCAGAAATTTG 
CCAGTTCTAGAAATATAATAGATGTAGTATCTACCCCTTGGACTGCAGAACTTGGAGGTGATCAGCTGAGAACGA 
TGACCTACACTATAGTCCTTAATAGTCCACTTACTGGAAAATGCACTGCTGCCACTGAAAAGCAGACACTGTATA 
AAGAAAGTCGGGAAGCACGATTTTATTTGGTAGATTCAGAAGTACTGACACATGATGTCCCCTACCATGATTACT 
TCTATACCGTGAACAGATACTGTATCATCCGATCTTCAAAACAGAAATGCAGGCTAAGAGTTTCCACAGATTTGA 
AATACAGAAAACAGCCATGGGGCCTTGTCAAATCTTTAATTGAAAAGAATTCCTGGAGTTCTTTGGAGGACTATT 
TCAAACAGCTTGAATCAGATTTGTTAATTGAAGAATCTGTATTAAATCAGGCCATTGAAGACCCTGGAAAACTTA 
CTGGCCTACGAAGGAGAAGGCGAACCTTCAACCGAACAGCAGAAACAGTTCCTAAACTTTCCTCTCAGCATTCCT 
CTGGAGATGTGGGCTTAGGTGCCAAAGGGGATATTACAGGAAAGAAAAAGGAAATGGAAAACTATAACGTCACTC 
TTATTGTGGTAATGAGTATTTTTGTGTTGTTATTAGTTTTGTTGAATGTGACACTGTTTCTGAAGCTGTCAAAGA 
TAGAACATGCTGCTCAGTCCTTTTACCGTCTCCGCCTCCAAGAAGAGAAATCTTTAAATTTAGCCTCTGATATGG 
TGTCAAGAGCAGAAACTATTCAGAAGAATAAAGATCAGGCCCATCGTTTAAAGGGAGTGCTCCGAGACTCCATAG 
TGATGCTTGAACAGCTGAAGAGCTCACTCATTATGCTTCAGAAAACGTTTGATCTACTAAATAAGAATAAGACTG 
GCATGGCTGTTGAAAGCTAGTGATCTGAAGGACTAAAACCGCAGAGATACTTGGAACTTAAAGAAAATACCTGGA 
AGAAAACCAGACGAATGAAGGATTTTGGCATAGAACATTTCTATGTTTTTTCATTATTGAGATTTCTAATATGAA 
CATTTCTTTCAGTAACATTTATTTGATAATTAGTTTCTGCTGGCCTTAATAATCCATCCTTTCACTTCTTATAGA 
TATTTTTAAGCTGTGAATTTCTTCAGTGAACCATGAAATATATTATAGAACTGAATTTCTCTGATACAAAAAGAA 
AATGACACACCCTGAATTGAGTGGTATGGTCTCATTTCTACAGTGAAGTCTGATGCTTTGTTAGCACAGAATCCG 
TACATGTCCAATAGGTCGCTTTTGTAACTGAGATAAGACCAAGAGGATAAACAGGACAATATAAGAAGAAACCTC 
TATGTCATTACTGATTTTAAAGGTTCTGTTTTCAGGCATATAACATTTCCAGGTTTGTGTACTGTAAAGATTATA 
ATGTCTTCATTTATTTAGCATGCAAATTTAATAGTCAAACTTTTTGAATCTGCATGTTGATGATGATTATCAGAA 
AGGGTCTTCTGCCATGCTGTATCTTTATGAAAGAAATAGTTGTTTTTTCTTAAGGTAACTATCAGAGGTGGGATT 
• ATCTTGCCTCCTCACTTAGAATACCAACAGTCAAAAGGAAGAACCATCCTCTGAGTTTTAAAAACCAGAAGGTTA 
TGTTAAAATCTGGGCATTTAGTGACAGATCAAATGCATACTTGAACTAAGATTGGCTTCAGCTTAGCAGTCTTTC 
ATGGTGGAAGTGACACATCTGGTTGAAAATAATTTGTGTATTTTCAGTAACCATGTATGGCTTCCTTCTTTATGT 
ATGTGTGTGACTTGTTTTAATTGGTAAGTTATAAGCCAGACATAGATTTTAGCTCTTTAATAAAAACTTCAGGGG 
CACGTATGTCCCAGTACAAGTGTACTGACTATCAAGTTTTAACTCAGATGCAAGCTTTGGCTCTTTCATAAAAAG 
TTTTTATGCATATGTGTCTCCATACAAGTGGCTCATTAAAATAAGAACTTTGTAAACTGACTTAAAATCAGATAT 
TTTTTCAAGAGTTAGGGAAAGTTGAAGTGTTTTACTGTTTTGTCTCTTGAGCCCTTTCTCTGGGGAAAAAATACA 
TATCCATCTATCTATCTATATATAAACTGTGTATACATTCTTACTGTTTGAACAACTATTGCCTTTAATTAAATG 
TTTCATTTTTCTCCAGAGTCCCCAAAGCCACATGGCATTATTATAGTCATTTTTGAGATGCCTGTAGAGAATGAA 
AGTATTGACTCCGTTAGAGGGAAAATGGGTTTCTCTGGGTGAATTCCAACGAAGCATACCTAGGGGTAACAGTGA 
ACCTACCTGGGTTTGTTTTGTTTTGGTAAGGATTTATGTAGTGTCTGGCTGTAAGCAAGAATGAGTGGATTATAA 
ACTTGAAGATTTCTCTGTTAAAGTCACAAAAATGATCGACAAACAATATTTTTGTGATGTTTATTTAAACGTTGT 
ATTTTATAACATACTTCAAGGAAGAGTATCGAAGTAAGTTGCTTTATAAATTAAGACTAAATTCGTATGGATGCA 
GAATTCAATTAATAAAATTTGAGCCTGTTACGTAAATTGAATATT r AATAAAATTGAAAATTTCAAAA 
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FIGURE 236 

MENLSLSIEDVQPRSPGRSSLDDSGERDEKLSKSISFTSESISRVSETESFDGNSSKGGLGKE 
ESQNEKQTKKSLLPTLEKKLTRVPSKSLDLNKNEYLSLDKSSTSDSVDEENVPEKDLHGRLFI 
NRIFHISADRMFELLFTSSRFMQKFASSRNIIDWSTPWTAELGGDQLRTMTYTIVLNSPLTG 
KCTAATEKQTLYKESREARFYLVDSEVLTHDVPYHDYFYTVNRYCIIRSSKQKCRLRVSTDLK 
YRKQPWGLVKSLIEKNSWSSLEDYFKQLESDLLIEESVLNQAIEDPGKLTGLRRRRRTFNRTA 
ETVPKLSSQHSSGDVGLGAKGDITGKKKEMENYNVTLIVVMSIFVLLLVLLNVTLFLKLSKIE 
HAAQSFYRLRLQEEKSLNLASDMVSRAETIQKNKDQAHRLKGVLRDSIVMLEQLKSSLIMLQK 

TFDLLNKNKTGMAVES 

Transmembrane domain: 

amino acids 352-371 

N-glycosylation sites. 

amino acids 3-7, 54-58, 312-316, 349-353, 367-371, 449-453 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 81-85, 307-311 

Tyrosine kinase phosphorylation sites. 

amino acids 202-211, 246-254, 341-349 

N-myristoylation site. 

amino acids 259-265 

Amidation site. 

amino acids 339-343 
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FTGURE 237 

CAGGGGCTGGAGGGCAGGGGAGGGGATGATGTCATTCCTGCTCGGCGCAATCCTGACCCTGCT 
CTGGGCGCCCACGGCTCAGGCTGAGGTTCTGCTGCAGCCTGACTTCAATGCTGAAAAGTTCTC 
AGGCCTCTGGTACGTGGTCTCCATGGCATCTGACTGCAGGGTCTTCCTGGGCAAGAAGGACCA 
CCTGTCCATGTCCACCAGGGCCATCAGGCCCACAGAGGAGGGCGGCCTCCACGTCCACATGGA 
GTTCCCGGGGGCGGACGGCTGTAACCAGGTGGATGCCGAGTACCTGAAGGTGGGCTCCGAGGG 
ACACTTCAGAGTCCCGGCCTTGGGCTACCTGGACGTGCGCATCGTGGACACAGACTACAGCTC 
CTTCGCCGTCCTTTACATCTACAAGGAGCTGGAGGGGGCCCTCAGCACCATGGTGCAGCTCTA 
CAGCCGGACCCAGGATGTGAGTCCCCAGGCTCTGAAGTCCTTCCAGGACTTCTACCCGACCCT 
GGGGCTCCCCAAGGACATGATGGTCATGCTGCCCCAGTCAGATGCATGCAACCCTGAGAGCAA 
GGAGGCGCCCTGACACCTCCGGAGCCCCACCCCCGCCCTTCCCAGGTGGAGCCAAAGCAGCAG 
GCGCCTTTGCCCCTGGAGTCAAGACCCACAGCCCTCGGGGACCACCTGGAGTCTCTCCATCCT 
CCACCCCCCGCCTGTGGGATGCCTTGTGGGACGTCTCTTTCTATTCAATAAACAGATGCTGCA 



GCCTCA 
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FIGURE 238 

MMSFLLGAILTLLWAPTAQAEVLLQPDFNAEKFSGLWYVVSMASDCRVFLGKKDHLSMSTRAI 
RPTEEGGLHVHMEFPGADGCNQVDAEYLKVGSEGHFRVPALGYLDVRIVDTDYSSFAVLYIYK 
ELEGALSTMVQLYSRTQDVSPQALKSFQDFYPTLGLPKDMMVMLPQSDACNPESKEAP 

Signal peptide: 

amino acids 1-20 

Tyrosine kinase phosphorylation site. 

amino acids 110-117 

N-myristoylation sites. 

amino acids 7-13, 79-85, 130-136 

Amidation site. 

amino acids 50-54 
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FIGURE 239 

GGCGCGCTGGTCCAGGTGAGCGGGCGCGTCCCCGCGACGGCGCTGCCTGCCCGAGGCGGTTCA 
CGTAAAGACAGCGAGATCCTGAGGGCCAGCCGGGAAGGAGGCGTGGATATGGAGCTGGCTGCT 
GCCAAGTCCGGGGCCCGCGCCGCTGCCTAGCGCGTCCTGGGGACTCTGTGGGGACGCGCCCCG 
CGCCGCGGCTCGGGGACCCGTAGAGCCCGGCGCTGCGCGCATGGCCCTGCTCTCGCGCCCCGC 
GCTCACCCTCCTGCTCCTCCTCATGGCCGCTGTTGTCAGGTGCCAGGAGCAGGCCCAGACCAC 
CGACTGGAGAGCCACCCTGAAGACCATCCGGAACGGCGTTCATAAGATAGACACGTACCTGAA 
CGCCGCCTTGGACCTCCTGGGAGGCGAGGACGGTCTCTGCCAGTATAAATGCAGTGACGGATC 
TAAGCCTTTCCCACGTTATGGTTATAAACCCTCCCCACCGAATGGATGTGGCTCTCCACTGTT 
TGGTGTTCATCTTAACATTGGTATCCCTTCCCTGACAAAGTGTTGCAACCAACACGACAGGTG 
CTATGAGACCTGTGGCAAAAGCAAGAATGACTGTGATGAAGAATTCCAGTATTGCCTCTCCAA 
GATCTGCCGAGATGTACAGAAAACACTAGGACTAACTCAGCATGTTCAGGCATGTGAAACAAC 
AGTGGAGCTCTTGTTTGACAGTGTTATACATTTAGGTTGTAAACCATATCTGGACAGCCAACG 
AGCCGCATGCAGGTGTCATTATGAAGAAAAAACTGATCTTTAAAGGAGATGCCGACAGCTAGT 
GACAGATGAAGATGGAAGAACATAACCTTTGACAAATAACTAATGTTTTTACAACATAAAACT 
GTCTTATTTTTGTGAAAGGATTATTTTGAGACCTTAAAATAATTTATATCTTGATGTTAAAAC 
CTCAAAGCAAAAAAAGTGAGGGAGATAGTGAGGGGAGGGCACGCTTGTCTTCTCAGGTATCTT 
CCCCAGCATTGCTCCCTTACTTAGTATGCCAAATGTCTTGACCAATATCAAAAACAAGTGCTT 
GT T T AGCGGAGAATT TT GAAAAGAGGAAT ATAT AACTC AATTTTCAC AAC C AC AT TT ACC AAA 
AAAAGAGATCAAATATAAAATTCATCATAATGTCTGTTCAACATTATCTTATTTGGAAAATGG 
GGAAATTATCACTTACAAGTATTTGTTTACTATGAAATTTTAAATACACATTTATGCCTAGAA 
GGAACGGACTTTTTTTTTCTATTTTAATTACACATAATATGTAATTAAAGTACAACATAATAT 
GTTGTTTCTCTGTAGCCCGTTGAGCATATGAGTAAGTCACATTTCTATTAGGACTACTTACAA 
GGACAAGGTTTCCATTTTTCCAGTTGTAAAATTGGAACCATCAGCTGATAACCTCGTAGGGAG 
CAACCCCAGGATAGCTAAGTGTTATGTAATATGCCTAGAAGGTGATGTGAATGCGATTCAGAA 
GCATAGCCACTCCCATTTTATGAGCTACTCACATGACAAATGTCATCTTTTGCTATAACCTTT 
GCCAAGTTAGAGAAAAGATGGATTTAATGAGATAAATGAAAAGATATTTAACCTAAAAAAAAA 
AAAAAAAAAAAAAAAAAA 
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FIGURE 240 

MALLSRPALTLLLLLMAAVVRCQEQAQTTDWRATLKTIRNGVHKIDTYLNAALDLLGGEDGLC 
QYKCSDGSKPFPRYGYKPSPPNGCGSPLFGVHLNIGIPSLTKCCNQHDRCYETCGKSKNDCDE 
EFQYCLSKICRDVQKTLGLTQHVQACETTVELLFDSVIHLGCKPYLDSQRAACRCHYEEKTDL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites: 

amino acids 57-63,93-99 

Phospholipase A2 histidine active site: 

amino acids 106-114 

Neuraxin and MAP1B proteins repeat proteins Block: 

amino acids 109-137 
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FIGURE 241 

GATTCCGAGCGCCTCCACTGCTGGTCCGTTGGCCAGATCAACTCGCCGCGTGGGCCGGCCGTT 

CCCTGAGAGTCTGAGCGCTCGCCGCACCCCCTTCCGAGCTTCTATTGGCCGTAGCAGACGTCC 

GTCTGCCGCTATCTCCGCCCCAATACGGAAGCGGCCTAGTCCTCCGGCTCCGACAGCTGGGTG 

TCCAGGCCJMESGGGCAGCCCTGGGCGGCTGGGAGCACGGACGGGGCGCCCGCGCAGCTGCCTC 

TCGTGCTCACCGCGCTGTGGGCCGCGGCCGTGGGCCTGGAGCTGGCTTACGTGCTGGTGCTCG 

GTCCCGGGCCGCCGCCGCTGGGACCCCTGGCCCGGGCCTTGCAGCTGGCGCTGGCCGCCTTCC 

AGCTGCTCAACCTGCTGGGCAACGTGGGGCTCTTCCTGCGCTCGGATCCCAGCATCCGTGGCG 

TGATGCTGGCCGGCCGCGGTCTGGGCCAGGGCTGGGCTTACTGCTACCAATGCCAAAGCCAGG 

TGCCGCCACGCAGCGGACACTGCTCTGCCTGCCGCGTCTGCATCCTGCGTCGGGACCACCACT 

GCCGCCTGCTGGGCCGCTGCGTGGGCTTCGGCAACTACCGGCCCTTCCTGTGCCTGCTGCTTC 

ATGCCGCCGGCGTCCTGCTCCACGTCTCTGTGCTGCTGGGCCCTGCACTGTCGGCCCTGCTGC 

GAGCCCACACGCCCCTCCACATGGCTGCCCTCCTCCTGCTTCCCTGGCTCATGTTGCTCACAG 

GCAGAGTGTCTCTGGCACAGTTTGCCTTGGCCTTCGTGACGGACACGTGCGTGGCGGGTGCGC 

TGCTGTGCGGGGCTGGGCTGCTCTTCCATGGGATGCTGCTGCTGCGGGGCCAGACCACATGGG 

AGTGGGCTCGGGGCCAGCACTCCTATGACCTGGGTCCCTGCCACAACCTGCAGGCAGCCCTGG 

GGCCCCGCTGGGCCCTCGTCTGGCTCTGGCCCTTCCTGGCCTCCCCATTGCCTGGGGATGGGA 

TCACCTTCCAGACCACAGCAGATGTGGGACACACAGCCTCCTGACTCCAGGAAGAGCCAGAGC 

TGTGCAGGGAGGAAGGGGTGAGAGGGGGGCC-CCCACACCTAGACTCAGTAAGGAAGTCGGGTT. 

GGACCTTAACATCTGCATTGGACAACTCCACCCCTTCCTTGGCCTTGCCCCTGCCCGCCTACA 

CTCCTACGTGTCCAGGGCTTGGGCCGTGACTTAGGCAGAGGAGTGCAGAGGAGGGTCTGGCAG 

GGGCTGCTCAGGCCGCCTAGCTGCCCCTTTGCCAGGTTAATAAAGCACTGACTTGTTAA 
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FIGURE 242 

MGQPWT^GSTDGAPAQLPLVLTALWAAAVGLELAYVLVLGPGPPPLGPLARALQLALAAFQLL 
NLLGNVGLFLRSDPSIRGVMLAGRGLGQGWAYCYQCQSQVPPRSGHCSACRVCILRRDHHCRL 
LGRCVGFGNYRPFLCLLLHAAGVLLHVSVLLGPALSALLRAHTPLHMAALLLLPWLMLLTGRV 
SLAQFALAFVTDTCVAGALLCGAGLLFHGMLLLRGQTTWEWARGQHSYDLGPCHNLQAALGPR 

WALVWLW P FL AS PL PG DG I T FQTT ADVGHT AS 

Important features: 
Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 51-66,143-160,174-191,198-214 
N-myristoylation sites : 

amino acids 2-8,8-14,30-36,81-87,88-94,90-96,206-212 
Leucine zipper pattern: 

amino acids 143-165,150-172,157-179,164-186 § 
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PTGTIRE 243 

CTTGTCTTTGTGTCGGTTGTGATTTTCCTAATCTCTGATTTTCCTTTTCTCTCGGACGCTCTC 

CCTCTTCGGACCCATTTTCTCCCGTGCTTCATGCCCTGATAGCCTGGCCCCTTCCCGGCTTCC 

TTCGCTACCGGGGACGCCTCTAGTTTTTCTGAATTTCTGGCTGGCTCCACCCTCCGCGTTCAT 

CTTCCTCAAGAGTTCGCCCCTCTGGGGGCTCCTCTGTGTAATCGTCGCCTTCTCTGGGTATTT 

CTGTGAACTCCGTCTCACACCATCCCGCCATCTTCTCTGCCTTGGCCCCTTTTCTCTGTACAG 

CCAGCTCTGTGTCCTTTTCTTCTCCCCCTCTAAAATCGACTCCTCTTCTCCCTGAGAGCCCCA 

CCTTTGTGCCCCACTCCTCATTTTCCTACGCCTCCCTCTCTCTGCTGGTCCTCTCTCTCCCTG 

CAAGGTTCCATTCCATCAATTTGTTTGTCTTTTGTAGGGGTGGCATCCCCTCTGACTACTGCT 

CCATCCTTTTTTTTTTTTTTTTTTTTTTTTTTGCTTGAGGATTTCACTTCAATCTTTTCTGGT 

TGCGTCTCCACTTGTACTCAGCTTGTTAGGTCCAGGTCCAGTTGTTCTGCATCTGAGGCTGGC 

GTGTGCTGTCTTCTCTGATTGGCCTAATCTCCCTCACCCCCGTGAGATCTGTTGTCAGCCTTC 

GTTTCTCTTTCCTGTGTCCCAGCTTTTCTGCGGGTCTTGGCACCTTTCTTGGCCACAGATTTC 

TGGGTTACAGAGCATGTGTGTCTGAGGCATTGCAGGCAGAAAAGGGTGGCCGACGTGACCTCT 

AGCTGGACTGCTGGGCAGGGGAGCTGTCCTAGATAAAATTGGAAAGAAACAGTGACCCAGAGA 

CAGGTGGACAAAGAATTCGGGGACTGATGGGAACTGAGCTTGGGATCCAGACTGAAACTGATT 

CCAGACTGACCTCTAGCACCCAGGACCCAGACACAGGGCCASGGGACCCCAGCATTTGAGACT 

TGTGCAGCTGTTCTGCCTTCTAGGGGCCATCCCCACTCTGCCTCGGGCTGGAGCTCTTTTGTG 

CTATGAAGCAACAGCCTCAAGATTCAGAGCTGTTGCTTTCCATAACTGGAAGTGGCTTCTGAT 

GAGGAACATGGTGTGTAAGCTGCAAGAGGGCTGCGAGGAGACGCTAGTGTTCATTGAGACAGG 

GACTGCAAGGGGAGTTGTGGGCTTTAAAGGCTGCAGCTCGTCTTCGTCTTACCCTGCGCAAAT 

CTCCTACCTTGTTTCCCCACCCGGAGTGTCCATTGCCTCCTACAGTCGCGTCTGCCGGTCTTA 

TCTCTGCAACAACCTCACCAATTTGGAGCCTTTTGTGAAACTCAAGGCCAGCACTCCTAAGTC 

TATCACATCTGCGTCCTGTAGCTGCCCGACCTGTGTGGGCGAGCACATGAAGGATTGCCTCCC 

AAATTTTGTCACCACTAATTCTTGCCCCTTGGCTGCTTCTACGTGTTACAGTTCCACCTTAAA 

ATTTCAGGCAGGGTTTCTCAATACCACCTTCCTCCTCATGGGGTGTGCTCGTGAACATAACCA 

GCTTTTAGCAGATTTTCATCATATTGGGAGCATCAAAGTGACTGAGGTCCTCAACATCTTAGA 

GAAGTCTCAGATTGTTGGTGCAGCATCCTCCAGGCAAGATCCTGCTTGGGGTGTCGTCTTAGG 

CCTCCTGTTTGCCTTCAGGGACTGACCATCTAGCTGCACCCGACAAGCACCCAGACTCTTTCA 

CAT AAC AAAT AAAAT AGC AG AGTT CC CT TAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAA 
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FIGURE 244 

MGPQHLRLVQLFCLLGAIPTLPRAGALLCYEATASRFRAVAFHNWKWLLMRNMVCKLQEGCEE 
TLVFIETGTARGWGFKGCSSSSSYPAQISYLVSPPGVSIASYSRVCRSYLCNNLTNLEPFVK 
LKASTPKSITSASCSCPTCVGEHMKDCLPNFVTTNSCPLAASTCYSSTLKFQAGFLNTTFLLM 
GCAREHNQLLADFHHIGSIKVTEVLNILEKSQIVGAASSRQDPAWGVVLGLLFAFRD 

Important features : 
Signal peptide: 

amino acids 1-20 

N-glycosylation sites : 

amino acids 117-121,183-187 

N-myristoylation sites: 

amino acids 16-22,25-31,60-66,71-77,81-87,100-106,224-230, 
235-241,239-245 

Prokaryo tic membrane lipoprotein lipid attachment site: 

amino ' acids 181-192 
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FIGURE 245 

GTGGAGTTGGGTGGTGTCGGGAGCCTCTCCCTGAGGGGCACCGCGTCTTCAGGAGCTGGGCCTCCAGTGCGGCGC 
GATGTCAGGCGCGGTGACAGCTCTGTGAGTCCGAGGCCGCGGCCGTGGCGCTGGGCGGCTGCGGGGCCTGACCGG 
TCCGCTC ATG GTGCCGCCACGACGCCATCGCGGGGCAGGAAGGCCAGGGGTGCTGAGTTCTTCACCTCCTTTTAG 
ACTGAGATCTGCCAAGTTTTCCGGCATTGCTCTTGAGGATCTCAGAAGGGCTCTTAAGACAAGACTGCAAATGGT 
GTGTGTATTTGTCATGAACCGAATGAATTCCCAGAACAGTGGTTTCACTCAGCGCAGGCGAATGGCTCTTGGGAT 
TGTTATTCTTCTGCTTGTTGATGTGATATGGGTTGCTTCCTCTGAACTTACTTCGTATGTTTTTACCCAGTACAA 
CAAACCATTCTTCAGCACCTTTGCAAAAACATCTATGTTTGTTTTGTACCTTTTGGGCTTTATTATTTGGAAGCC 
ATGGAGACAACAGTGTACAAGAGGACTTCGCGGAAAGCATGCTGCTTTTTTTGCAGATGCTGAAGGTTACTTTGC 
TGCTTGCACAACAGATACAACTATGAATAGTTCTTTGAGTGAACCTCTGTATGTGCCTGTGAAATTCCATGATCT 
TCCAAGTGAAAAACCTGAGAGCACAAACATTGATACTGAAAAAACCCCCAAAAAGTCTCGTGTGAGGTTCAGTAA 
TATCATGGAGATTCGACAGCTTCCGTCAAGTCATGCATTGGAAGCA/^AGTTGTCTCGCATGTCATATCCTGTGAA 
AGAACAAGAATCCATACTGAAAACTGTGGGGAAACTTACTGCAACTCAAGTAGCGAAAATTAGCTTTTTTTTTTG 
CTTTGTGTGGTTTTTGGCAAATTTGTCATATCAAGAAGCACTTTCAGACACACAAGTTGCTATAGTTAATATTTT 
ATCTTCAACTTCCGGACTTTTTACCTTAATCCTTGCTGCAGTATTTCCAAGTAACAGTGGAGATAGATTTACCCT 
TTCTAAACTATTAGCTGTAATTTTAAGCATTGGAGGCGTTGTACTGGTAAACCTGGCAGGGTCTGAAAAACCTGC 
TGGAAGAGACACAGTAGGTTCCATTTGGTCTCTTGCTGGAGCCATGCTCTATGCTGTCTATATTGTTATGATTAA 
GAGAAAAGTAGATAGAGAAGACAAGTTGGATATTCCAATGTTCTTTGGTTTTGTAGGTTTGTTTAATCTGCTGCT 
CTTATGGCCAGGTTTCTTTTTACTTCATTATACTGGATTTGAGGACTTCGAGTTTCCCAATAAAGTAGTATTAAT 
GTGCATTATCATTAATGGCCTTATTGGAACAGTACTCTCAGAGTTCCTGTGGTTGTGGGGCTGCTTTCTTACCTC 
ATCATTGATAGGCACACTTGCACTAAGCCTTACAATACCTCTGTCCATAATAGCTGACATGTGTATGCAAAAGGT 
GCAGTTTTCTTGGTTATTTTTTGCAGGAGCTATCCCTGTATTTTTTTCATTTTTTATTGTAACTCTCCTATGCCA 
TTATAATAATTGGGATCCTGTGATGGTGGGAATCAGAAGAATATTTGCTTTTATATGCAGAAAACATCGAATTCA 
GAGAGTTCCAGAAGACAGCGAACAGTGTGAGAGTCTCATTTCTATGCACAGTGTTTCTCAGGAGGATGGAGCTAG 
T TAG CTGTCTGTTGTCTGTAGCCCAGCTTGATAATGGAACTATACAGCGAAGAGACAATCTCTGGCAAGTTTTTG 
TAGAAAAAATGTTTCAGTGCCTAGTCTGAAAAATAACAGTTTGAGTTCTTTGAAACTCTAAAATATATTTTTCTC 
ATACCTGTTTTCTTCATTTTCATAATGAAGCACTTTGCTATGTAGCTGTGTACATATCACTACAGTTATAGGAAG 
TTTCAGTCTACAGTCCATCCAAAGGACCAACCTGCCTTACACATCTCAAGGAATTCAGCTGTTGAAATCATTTGA 
ACTAATCAAGGAATAAATCCTAATGTTCTGGGACTTTATTTTCACATGTTAAATGCTGGAATATATTATGAAAAT 
GTTTTCAAGAAATCACTTAAGTGTTCATAGACCAGTATTTCTGACAGGTAAAATGCTAAAATAAGCTACCTGTAA 
TAAGTGTGGATTATATTTTTGGGTTTTGTAGAATATTGCAAATTAACCACACAAAAAATGTTTAATTTATGCAAC 
AAGCATGTTTGTGCAAATTTCATGGGACTTTAAAAAGAATAAGTATTTGAGAAAATATCTGGTTCACTTACACTA 
CATTTACTGTATTATTCTTTTATAGCATTAGGTGCCTTGTATTTTAAATCTGTGACAAACCATGGCAAATTTTTA. 
AAGGGGAAGTATTATTATAAAATGAAGAAATATGTATTTCTAAAGGCTATATTGCTGTAAACTTAATTGATAAAG 
CTCTGTTTAATTTAGAGTTTTGAAGAAATAGTCTCCCTTCAATTAAGAAATTTTCATAATGGAATGATTTAAATT 
G AAGT G AC AAAG AGT AT T ATT AAAAT AC AAT GT TT AT AAAAAAA 
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FIGURE 246 

MVPPRRHRGAGRPGVLSSSPPFRLRSAKFSGIALEDLRRALKTRLQMVCVFVMNRMNSQNSGF 
TQRRRMALGIVILLLVDVIWVASSELTSYVFTQYNKPFFSTFAKTSMFVLYLLGFIIWKPWRQ 
QCTRGLRGKHAAFFADAEGYFAACTTDTTMNSSLSEPLYVPVKFHDLPSEKPESTNIDTEKTP 
KKSRVRFSNIMEIRQLPSSHALEAKLSRMSYPVKEQESILKTVGKLTATQVAKISFFFCFVWF 
LANLSYQEALSDTQVAIVNILSSTSGLFTLILAAVFPSNSGDRFTLSKLLAVILSIGGVVLVN 
LAGSEKPAGRDTVGSIWSLAGAMLYAVYIVMIKRKVDREDKLDIPMFFGFVGLFNLLLLWPGF 
FLLHYTGFEDFEFPNKVVLMCIIINGLIGTVLSEFLWLWGCFLTSSLIGTLALSLTIPLSIIA 
DMCMQKVQFSWLFFAGAIPVFFSFFIVTLLCHYNNWDPVMVGIRRIFAFICRKHRIQRVPEDS 

EQCESLISMHSVSQEDGAS 

Important features : 
Transmembrane domain: 

amino acids 69-87,105-118,237-256,266-285,300-316,332-346, 
364-379,399-419,453-472 

N-glycosylation sites: 

amino acids 157-161,255-259 

N-myristoylation sites: 

amino acids 14-20,329-335,404-410,407-413,418-424 
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FIGURE 247 

CGTCTGTAGAGATATCATGAACTTCAACTTAGCTTTGGTACTTTCTTCCCTGAAGACAGAGGG 
CAGAACTCTGAGTTCCAGAACCATTTTCAACTGTATTGGGGACCAATCACTTGACTCTATTCT 
TGTCTCTCTGACAGATGACGCTACACTCTCCTCTGAATAATGGACACCATTTCTAAAACTGAA 
TCCTGCTACTAAAATAATTCAGATGATATATTTTTCCAATTCTACAATCTTGCTTTGTTTTAT 
TTAGTTGTTTTCTCTCTCTCTTCCCAGTTTTCCAGAGACTGGAGCTAAACTGGGCTTTCAACA 
TCATCAT6AAGTTTATCCTCCTCTGGGCCCTCTTGAATCTGACTGTTGCTTTGGCCTTTAATC 
CAGATTACACAGTCAGCTCCACTCCCCCTTACTTGGTCTATTTGAAATCTGACTACTTGCCCT 
GCGCTGGAGTCCTGATCCACCCGCTTTGGGTGATCACAGCTGCACACTGCAATTTACCAAAGC 
TTCGGGTGATATTGGGGGTTACAATCCCAGCAGACTCTAATGAAAAGCATCTGCAAGTGATTG 
GCTATGAGAAGATGATTCATCATCCACACTTCTCAGTCACTTCTATTGATCATGACATCATGC 
TAATCAAGCTGAAAACAGAGGCTGAACTCAATGACTATGTGAAATTAGCCAACCTGCCCTACC 
AAACTATCTCTGAAAATACCATGTGCTCTGTCTCTACCTGGAGCTACAATGTGTGTGATATCT 
ACAAAGAGCCCGATTCACTGCAAACTGTGAACATCTCTGTAATCTCCAAGCCTCAGTGTCGCG 
ATGCCTATAAAACCTACAACATCACGGAAAATATGCTGTGTGTGGGCATTGTGCCAGGAAGGA 
GGCAGCCCTGCAAGGAAGTTTCTGCTGCCCCGGCAATCTGCAATGGGATGCTTCAAGGAATCC 
TGTCTTTTGCGGATGGATGTGTTTTGAGAGCCGATGTTGGCATCTATGCCAAAATTTTTTACT 
ATATACCCTGGATTGAAAATGTAATCCAAAATAACTGAGCTGTGGCAGTTGTGGACCATATGA 
CACAGCTTGTCCCCATGGTTCACCTTTAGAATTAAATATAAATTAACTCCTC 
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FIGURE 248 

MKFI LLWALLNLTVALAFN P DYTVS ST P P YLV YLKS DYLPCAGVL I H PLWVI T AAHCNL PKLR 
VILGVTIPADSNEKHLQVIGYEKMIHHPHFSVTSIDHDIMLIKLKTEAELNDYVKLANLPYQT 
ISENTMCSVSTWSYNVCDIYKEPDSLQTVNISVISKPQCRDAYKTYNITENMLCVGIVPGRRQ 
PCKEVSAAPAICNGMLQGILSFADGCVLRADVGIYAKIFYYIPWIENVIQNN 

Important features: 
Signal peptide: 

amino acids 1-17 

N-glycosylation sites: 

amino acids 11-15,156-160,173-177 

Tyrosine kinase phosphorylation site: 

amino acids 108-117 

N-myristoylation sites: 
amino acids 182-188,203-209 

Amidation site: 

amino acids 185-189 

Serine proteases, trypsin family, histidine active site: 

amino acids 52-58 
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FIGURE 249 

GCGAGGCGGCCGCTGTCTTCTGCTGCGGCTTCCGCGACCACAAGTACTGCTGCGACGACCCGC 
ACAGCTTCTTCCCCTACGAGCACAGCTACASgTGGTGGCTCAGCATTGGCGCTCTCATAGGCC 
TGTCCGTAGCAGCAGTGGTTCTTCTCGCCTTCATTGTTACCGCCTGTGTGCTCTGCTACCTGT 
TCATCAGCTCTAAGCCCCACACAAAGTTGGACCTGGGCTTGAGCTTACAGACAGCAGGCCCTG 
AGGAGGTTTCTCCTGACTGCCAAGGTGTGAACACAGGCATGGCGGCAGAAGTGCCAAAAGTGA 
GCCCTCTCCAGCAGAGTTACTCCTGCTTGAACCCGCAGCTGGAGAGCAATGAGGGGCAGGCTG 
TGAACTCCAAACGCCTCCTCCATCATTGCTTCATGGCCACAGTGACCACCAGTGACATTCCAG 
GCAGCCCTGAGGAAGCCTCTGTACCCAACCCTGACCTATGTGGACCAGTCCCATAAACATTCA 
AT AAATGT CT CC ATACC AT C AA 
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FIGURE 250 

MWWLSIGALIGLSVAAWLLAFIVTACVLCYLFISSKPHTKLDLGLSLQTAGPEEVSPDCQGV 
NTGMAAEVPKVSPLQQSYSCLNPQLESNEGQAVNSKRLLHHCFMATVTTSDIPGSPEEASVPN 

PDLCGPVP 

Important features: 
Signal peptide: 

Amino acids 1-26 

N-myristoylation sites : 

Amino acids 7-13,11-17,62-68,93-99 
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FIGURE 251 

GTGGTTTGGATTGAGCCGGGCCCGGCCGGGGCGCCGAGTCGGAGGGGGTGGCAGTGAGCGGCG 
GCAGAGGCTACGGGGCTCGGTTTGGCTGACTGGGGAGTCGGCAGGCGGCAGGAACCATGCGAG 
GCCAGCGGAGCCTGCTGCTGGGCCCGGCCCGCCTCTGCCTCCGCCTCCTTCTGCTGCTGGGTT 
ACAGGCGCCGCTGTCCACCTCTACTCCGGGGTCTAGTACAGCGCTGGCGCTACGGCAAGGTCT 
GCCTGCGCTCCCTGCTCTACAACTCCTTTGGGGGCAGTGACACCGCTGTTGATGCTGCCTTTG 
AGCCTGTCTACTGGCTGGTAGACAACGTGATCCGCTGGTTTGGAGTGGTGTTCGTGGTCCTGG 
TGATCGTGCTGACAGGCTCCATTGTAGCTATCGCCTACCTGTGTGTCCTGCCTCTCATCCTCC 
GAACCTACTCAGTGCCACGACTCTGCTGGCATTTCTTCTATAGCCACTGGAATCTGATCCTGA 
TTGTCTTCCACTACTACCAGGCCATCACCACTCCGCCTGGGTACCCACCCCAGGGCAGGAATG 
ATATCGCCACCGTCTCCATCTGTAAGAAGTGCATTTACCCCAAGCCAGCCCGAACACACCACT 
GCAGCATCTGCAACAGGTGTGTGCTGAAGATGGATCACCACTGCCCCTGGCTAAACAATTGTG 
TGGGCCACTATAACCATCGGTACTTCTTCTCTTTCTGCTTTTTCATGACTCTGGGCTGTGTCT 
ACTGCAGCTATGGAAGTTGGGACCTTTTCCGGGAGGCTTATGCTGCCATTGAGACTTATCACC 
AGACCCCACCACCCACCTTCTCCTTTCGAGAAAGGATGACTCACAAGAGTCTTGTCTACCTCT 
GGTTCCTGTGCAGTTCTGTGGCACTTGCCCTGGGTGCCCTAACTGTATGGCATGCTGTTCTCA 
TCAGTCGAGGTGAGACTAGCATCGAAAGGCACATCAACAAGAAGGAGAGACGTCGGCTACAGG 
CCAAGGGCAGAGTATTTAGGAATCCTTACAACTACGGCTGCTTGGACAACTGGAAGGTATTCC 
TGGGTGTGGATACAGGAAGGCACTGGCTTACTCGGGTGCTCTTACCTTCTAGT.CACTTGCCCC 
ATGGGAATGGAATGAGCTGGGAGCCCCCTCCCTGGGTGACTGCTCACTCAGCCTCTGTGATGG 
CAGTGTGAGCTGGACTGTGTCAGCCACGACTCGAGCACTCATTCTGCTCCCTATGTTATTTCA 
AGGGCCTCCAAGGGCAGCTTTTCTCAGAATCCTTGATCAAAAAGAGCCAGTGGGCCTGCCTTA 
GGGTACCATGCAGGACAATTCAAGGACCAGCCTTTTTACCACTGCAGAAGAAAGACACAATGT 
GGAGAAATCTTAGGACTGACATCCCTTTACTCAGGCAAACAGAAGTTCCAACCCCAGACTAGG 
GGTCAGGCAGCTAGCTACCTACCTTGCCCAGTGCTGACCCGGACCTCCTCCAGGATACAGCAC 
TGGAGTTGGCCACCACCTCTTCTACTTGCTGTCTGAAAAAACACCTGACTAGTACAGCTGAGA 
TCTTGGCTTCTCAACAGGGCAAAGATACCAGGCCTGCTGCTGAGGTCACTGCCACTTCTCACA 
TGCTGCTTAAGGGAGCACTVAATAAAGGTATTCGATTTTTAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA . 
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FIGURE 252 

MRGQRSLLLGPARLCLRLLLLLGYRRRCPPLLRGLVQRWRYGKVCLRSLLYNSFGGSDTAVDA 
AFEPVYWLVDNVIRWFGWFVVLVIVLTGSIVAIAYLCVLPLILRTYSVPRLCWHFFYSHWNL 
ILIVFHYYQAITTPPGYPPQGRNDIATVSICKKCIYPKPARTHHCSICNRCVLKMDHHCPWLN 
NCVGHYNHRYFFSFCFFMTLGCVYCSYGSWDLFREAYAAIETYHQTPPPTFSFRERMTHKSLV 
YLWFLCSSVALALGALTVWHAVLISRGETSIERHINKKERRRLQAKGRVFRNPYNYGCLDNWK 
VFLGVDTGRHWLTRVLLPSSHLPHGNGMSWEPPPWVTAHSASVMAV 

Important features: 
Transmembrane domain: 

amino acids 88-100,202-216,254-274 

N-myristoylation sites: 

amino acids 55-61,56-62,92-98,210-216,309-315,319-325,340-346 

Prokaryotic membrane lipoprotein lipid attachment site: 

amino acids 201-212 
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FIGURE 253 

GATCAAGCGCCTTCCTTTCCCTTCCTCTCCCTACTTGGCCTTTGCCCTAAGCCAAGACCTGGCCATCAGCCTGGC 
TGCAGGGGCCTGCAGAGCCAGCTGCACTTTTTCAGGTATGGGGGAGGGCCAGGCACCATGAAGCCAGTGTGGGTC 
GCCACCCTTCTGTGGATGCTACTGCTGGTGCCCAGGCTGGGGGCCGCCCGGAAGGGGTCCCCAGAAGAGGCCTCC 
TTCTACTATGGAACCTTCCCTCTTGGCTTCTCCTGGGGCGTGGGCAGTTCTGCCTACCAGACGGAGGGCGCCTGG 
GACCAGGACGGGAAAGGGCCTAGCATCTGGGACGTCTTCACACACAGTGGGAAGGGGAAAGTGCTTGGGAATGAG 
ACGGCAGATGTAGCCTGTGACGGCTACTACAAGGTCCAGGAGGACATCATTCTGCTGAGGGAACTGCACGTCAAC 
CACTACCGATTCTCCCTGTCTTGGCCCCGGCTCCTGCCCACAGGCATCCGAGCCGAGCAGGTGAACAAGAAGGGA 
ATCGAATTCTACAGTGATCTTATCGATGCCCTTCTGAGCAGCAACATCACTCCCATCGTGACCTTGCACCACTGG 
GATCTGCCACAGCTGCTCCAGGTCAAATACGGTGGGTGGCAGAATGTGAGCATGGCCAACTACTTCAGAGACTAC 
GCCAACCTGTGCTTTGAGGCCTTTGGGGACCGTGTGAAGCACTGGATCACGTTCAGTGATCCTCGGGCAATGGCA 
GAAAAAGGCTATGAGACGGGCCACCATGCGCCGGGCCTGAAGCTCCGCGGCACCGGCCTGTACAAGGCAGCACAC 
CACATCATTAAGGCCCACGCCAAAACCTGGCATTCTTATAACACCACGTGGCGCAGCAAGCAGCAAGGTCTGGTG 
GGAATTTCACTGAACTGTGACTGGGGGGAACCTGTGGACATTAGTAACCCCAAGGACCTAGAGGCTGCCGAGAGA 
TACCTACAGTTCTGTCTGGGCTGGTTTGCCAACCCCATTTATGCCGGTGACTACCCCCAAGTCATGAAGGACTAC 
ATTGGAAGAAAGAGTGCAGAGCAAGGCCTGGAGATGTCGAGGTTACCGGTGTTCTCACTCCAGGAGAAGAGCTAC 
ATTAAAGGCACATCCGATTTCTTGGGATTAGGTCATTTTACTACTCGGTACATCACGGAAAGGAACTACCCCTCC 
CGCCAGGGGCCCAGCTACCAGAACGATCGTGACTTGATAGAGCTGGTTGACCCAAACTGGCCAGATCTGGGGTCT 
AAATGGCTATATTCTGTGCCATGGGGATTTAGGAGGCTCCTTAACTTTGCTCAGACTCAATACGGTGATCCTCCC 
ATATATGTGATGGAAAATGGAGCATCTCAAAAATTCCACTGTACTCAATTATGTGATGAGTGGAGAATTCAATAC 
CTTAAAGGATACATAAATGAAATGCTAAAAGCTATAAAAGATGGTGCTAATATAAAGGGGTATACTTCCTGGTCT 
CTGTTGGATAAGTTTGAATGGGAGAAAGGATACTCAGATAGATATGGATTCTACTATGTTGAATTTAACGACAGA 
AATAAGCCTCGCTATCCAAAGGCTTCAGTTCAATATTACAAGAAGATTATCATTGCCAATGGGTTTCCCAATCCA 
AGAGAGGTGGAAAGTTGGTACCTCAAAGCTTTGGAAACTTGCTCTATCAACAATCAGATGCTTGCTGCAGAGCCT 
TTGCTAAGTCACATGCAAATGGTTACGGAGATCGTGGTACCCACTGTCTGCTCCCTCTGTGTCCTCATCACTGCT 
GTTCTACTAATGCTCCTCCTGAGGAGGCAGAGCTGAGACAGGATTATCAATTTTGGAGCTTCATAAGAGAATCTT 
CAGGATCTTCCTCCCTTTTCTGCTTTGAGGGTTTCCATACATTGCTGTTTTCAGGTTCTACAATAATTACCTTTT 
TTTCTCTTTCTCTTTTTGGCTTGTGCTGGGATTTAAGAATTAGAAAATAAAAATAAGCAGAAATTA 
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FIGURE 254 

MKPVWVATLLWMLLLVPRLGAARKGSPEEASFYYGTFPLGFSWGVGSSAYQTEGAWDQDGKGPSIWDVFTHSGKG 
KVLGNETADVACDGYYKVQEDIILLRELHVNHYRFSLSWPRLLPTGIRAEQVNKKGIEFYSDLIDALLSSNITPI 
VTLHHWDLPQLLQVKYGGWQNVSMANYFRDYANLCFEAFGDRVKHWITFSDPRAMAEKGYETGHHAPGLKLRGTG 
LYKAAHHIIKAHAKTWHSYNTTWRSKQQGLVGISLNCDWGEPVDISNPKDLEAAERYLQFCLGWFANPIYAGDYP 
QVMKDYIGRKSAEQGLEMSRLPVFSLQEKSYIKGTSDFLGLGHFTTRYITERNYPSRQGPSYQNDRDLIELVDPN 
WPDLGSKWLYSVPWGFRRLLNFAQTQYGDPPIYVMENGASQKFHCTQLCDEWRIQYLKGYINEMLKAIKDGANIK 
GYTSWSLLDKFEWEKGYSDRYGFYYVEFNDRNKPRYPKASVQYYKKIIIANGFPNPREVESWYLBOUjETCSINNQ 
MLAAEPLLSHMQMVTEIWPTVCSLCVLITAVLLMLLLRRQS 

Important features: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 541-558 

N-glycosylation sites: 

amino acids 80-84,171-175,245-249 

Glycosaminoglycan attachment site: 

amino acids 72-76 

cAMP- and cGMP-dependent protein kinase phosphorylation sites: 

amino acids 23-27,564-568 

♦ 

Tyrosine kinase phosphorylation sites: 

amino acids 203-211,347-355,460-468,507-514 

N-myristoylation sites : 

amino acids 44-50,79-85,167-173,225-231,257-263,315-321 

Amidation site: 

amino acids 307-311 

Glycosyl hydrolases family 1 active site: 

amino acids 407-416 

Glycosyl hydrolases family 1 N- terminal signature: 

amino acids 41-56 

Motif name Glycosyl hydrolases family: 

amino acids 37- 67 
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FTGURE 255 

CGCGAAGMGCGAAAGGTGGTTTTGATCACCGGGGCTAGCAGTGGCATTGGCCTGGCCCTCTG 

CAAGCGGCTGCTGGCGGAAGATGATGAGCTTCATCTGTGTTTGGCGTGCAGGAACATGAGCAA 

GGCAGAAGCTGTCTGTGCTGCTCTGCTGGCCTCTCACCCCACTGCTGAGGTCACCATTGTCCA 

GGTGGATGTCAGCAACCTGCAGTCGGTCTTCCGGGCCTCCAAGGAACTTAAGCAAAGGTTTCA 

GAGATTAGACTGTATATATCTAAATGCTGGGATCATGCCTAATCCACAACTAAATATCAAAGC 

ACTTTTCTTTGGCCTCTTTTCAAGAAAAGTGATTCATATGTTCTCCACAGCTGAAGGCCTGCT 

GACCCAGGGTGATAAGATCACTGCTGATGGACTTCAGGAGGTGTTTGAGACCAATGTCTTTGG 

CCATTTTATCCTGATTCGGGAACTGGAGCCTCTCCTCTGTCACAGTGACAATCCATCTCAGCT 

CATCTGGACATCATCTCGCAGTGCAAGGAAATCTAATTTCAGCCTCGAGGACTTCCAGCACAG 

CAAAGGCAAGGAACCCTACAGCTCTTCCAAATATGCCACTGACCTTTTGAGTGTGGCTTTGAA 

CAGGAACTTCAACCAGCAGGGTCTCTATTCCAATGTGGCCTGTCCAGGTACAGCATTGACCAA 

TTTGACATATGGAATTCTGCCTCCGTTTATATGGACGCTGTTGATGCCGGCAATATTGCTACT 

TCGCTTTTTTGCAAATGCATTCACTTTGACACCATATAATGGAACAGAAGCTCTGGTATGGCT 

TTTCCACCAAAAGCCTGAATCTCTCAATCCTCTGATCAAATATCTGAGTGCCACCACTGGCTT 

TGGAAGAAATTATATTATGACCCAGAAGATGGACCTAGATGAAGACACTGCTGAAAAATTTTA 

TCAAAAGTTACTGGAACTGGAAAAGCACATTAGGGTCACTATTCAAAAAACAGATAATCAGGC 

CAGGCTCAGTGGCTCATGCCTATAATTCCAGCACTTTGGGAGGCCAAGGCAGAAGGATCACTT 

GAGACCAGGAGTTCAAGACCAGCCTGAGAAACATAGTGAGCCCTTGTCTCTACAAAAAGAAAT. 

AAAAATAATAGCTGGGTGTGGTGGCATGCGCATGTAGTCCCAGCTACTCAGAAGGATGAGGTG 

GGAGGATCTCTTGAGGCTGGGAGGCAGAGGTTGCAGTGAGCTGAGATTGTGCCACTGCACTCC 

AGCCTGGGTGACAGCGAGACCCTGTCTCAAAATATGTATATATTTAATATATATATAAAACCA 

GAGCTGACAATGACACTCTGGAACATTGCATACCTTCTGTACATTCTGGGGTACATGGATTTC 

TACTGAGTTGGATAATATGCATTTGTAATAAACTATGAACTATGAA 
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FIGURE 256 

MRKWLITGASSGIGLALCKRLLAEDDELHLCLACRNMSKAEAVCAALLASHPTAEVTIVQVD 

VSNLQSVFRAS KELKQRFQRLDCI Y LNAG IMPN PQLN I KAL F FGL FSRKV I HM FST AEGLLTQ 
GDKITADGLQEVFETNVFGHFILIRELEPLLCHSDNPSQLIWTSSRSARKSNFSLEDFQHSKG 
KEPYSSSKYATDLLSVALNRNFNQQGLYSNVACPGTALTNLTYGILPPFIWTLLMPAILLLRF 
FANAFTLTPYNGTEALVWLFHQKPESLNPLIKYLSATTGFGRNYIMTQKMDLDEDTAEKFYQK 

LLELEKHIRVTIQKTDNQARLSGSCL 

Important features : 
Transmembrane domain: 

amino acids 234-254 

N-glycosylation sites: 

amino acids 37-41,178-182,229-233,263-267 

Glycosaminoglycan attachment site: 

amino acids 12-16 



N-myristoylation sites: 

amino acids 9-15,13-19,15-21,215-221,224-230 
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FT CURE 257 

CGGACGCGTGGGGCCGTATGCGCGGCTCTGTGGAGTGCACCTGGGGTTGGGGGCACTGTGCCC 

CCAGCCCCCTGCTCCTTTGGACTCTACTTCTGTTTGCAGCCCCATTTGGCCTGCTGGGGGAGA 

AGACCCGCCAGGTGTCTCTGGAGGTCATCCCTAACTGGCTGGGCCCCCTGCAGAACCTGCTTC 

ATATACGGGCAGTGGGCACCAATTCCACACTGCACTATGTGTGGAGCAGCCTGGGGCCTCTGG 

CAGTGGTAATGGTGGCCACCAACACCCCCCACAGCACCCTGAGCATCAACTGGAGCCTCCTGC 

TATCCCCTGAGCCCGATGGGGGCCTGATGGTGCTCCCTAAGGACAGCATTCAGTTTTCTTCTG 

CCCTTGTTTTTACCAGGCTGCTTGAGTTTGACAGCACCAACGTGTCCGATACGGCAGCAAAGC 

CTTTGGGAAGACCATATCCTCCATACTCCTTGGCCGATTTCTCTTGGAACAACATCACTGATT 

CATTGGATCCTGCCACCCTGAGTGCCACATTTCAAGGCCACCCCATGAACGACCCTACCAGGA 

CTTTTGCCAATGGCAGCCTGGCCTTCAGGGTCCAGGCCTTTTCCAGGTCCAGCCGACCAGCCC 

AACCCCCTCGCCTCCTGCACACAGCAGACACCTGTCAGCTAGAGGTGGCCCTGATTGGAGCCT 

CTCCCCGGGGAAACCGTTCCCTGTTTGGGCTGGAGGTAGCCACATTGGGCCAGGGCCCTGACT 

GCCCCTCAATGCAGGAGCAGCACTCCATCGACGATGAATATGCACCGGCCGTCTTCCAGTTGG 

ACCAGCTACTGTGGGGCTCCCTCCCATCAGGCTTTGCACAGTGGCGACCAGTGGCTTACTCCC 

AGAAGCCGGGGGGCCGAGAATCAGCCCTGCCCTGCCAAGCTTCCCCTCTTCATCCTGCCTTAG 

CATACTCTCTTCCCCAGTCACCCATTGTCCGAGCCTTCTTTGGGTCCCAGAATAACTTCTGTG 

CCTTCAATCTGACGTTCGGGGCTTCCACAGGCCCTGGCTATTGGGACCAACACTACCTCAGCT 

GGTCGATGCTCCTGGGTGTGGGCTTCCCTCCAGTGGACGGCTTGTCCCCACTAGTCCTGGGCA 

TCATGGCAGTGGCCCTGGGTGCCCCAGGGCTCATGCTGCTAGGGGGCGGCTTGGTTCTGCTGC 

TGCACCACAAGAAGTACTCAGAGTACCAGTCCATAAATTAAGGCCCGCTCTCTGGAGGGAAGG 

ACATTACTGAACCTGTCTTGCTGTGCCTCGAAACTCTGGAGGTTGGAGCATCAAGTTCCAGCC 

GGCCCCTTCACTCCCCCATCTTGCTTTTCTGTGGAACCTCAGAGGCCAGCCTCGACTTCCTGG 

AGACCCCCAGGTGGGGCTTCCTTCATACTTTGTTGGGGGACTTTGGAGGCGGGCAGGGGACAG 

GGCTATTGATAAGGTCCCCTTGGTGTTGCCTTCTTGCATCTCCACACATTTCCCTTGGATGGG 

ACTTGCAGGCCTAAATGAGAGGCATTCTGACTGGTTGGCTGCCCTGGAAGGCAAGAAAATAGA 

TTTATTTTTTTTCACAGGGAAAAAAAAAAAA 



WO 01/40466 



PCT/US00/32678 



258/550 

FIGURE 258 

MRGSVECTWGWGHCAPSPLLLWTLLLFAAPFGLLGEKTRQVSLEVIPNWLGPLQNLLHIRAVG 
TNSTLHYVWSSLGPLAVVMVATNTPHSTLSINWSLLLSPEPDGGLMVLPKDSIQFSSALVFTR 
LLEFDSTNVSDTAAKPLGRPYPPYSLADFSWNNITDSLDPATLSATFQGHPMNDPTRTFANGS 
LAFRVQAFSRSSRPAQPPRLLHTADTCQLEVALIGASPRGNRSLFGLEVATLGQGPDCPSMQE 
QHSIDDEYAPAVFQLDQLLWGSLPSGFAQWRPVAYSQKPGGRESALPCQASPLHPALAYSLPQ 
SPIVRAFFGSQNNFCAFNLTFGASTGPGYWDQHYLSWSMLLGVGFPPVDGLSPLVLGIMAVAL 

GAPGLMLLGGGLVLLLHHKKYSEYQSIN 

Important features: 
Signal peptide: 

amino acids 1-35 

Transmembrane domain : 

amino acids 365-386 

N-glycosylation sites: 

amino acids 65-69,95-99,134-138,159-163,187-191,230-234,333-337 

♦ 

cAMP- and cGMP-dependent protein kinase phosphorylation site: 

amino acids 397-401 

N-myristoylation sites: 

amino acids 3-9,63-69,235-241,273-279,292-298,324-330 

Leucine zipper pattern: 

amino acids 371-393 
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FIGURE 259 

CAGGCGGGCCCCCGCGCGGCAGGGCCCTGGACCCGCGCGGCTCCCGGGGAT6GTGAGCAAGGCGCTGCTGCGCCT 
CGTGTCTGCCGTCAACCGCAGGAGGATGAAGCTGCTGCTGGGCATCGCCTTGCTGGCCTACGTCGCCTCTGTTTG 
GGGCAACTTCGTTAATATGAGGTCTATCCAGGAAAATGGTGAACTAAAAATTGAAAGCAAGATTGAAGAGATGGT 
TGAACCACTAAGAGAGAAAATCAGAGATTTAGAAAAAAGCTTTACCCAGAAATACCCACCAGTAAAGTTTTTATC 
AGAAAAGGATCGGAAAAGAATTTTGATAACAGGAGGCGCAGGGTTCGTGGGCTCCCATCTAACTGACAAACTCAT 
GATGGACGGCCACGAGGTGACCGTGGTGGACAATTTCTTCACGGGCAGGAAGAGAAACGTGGAGCACTGGATCGG 
ACATGAGAACTTCGAGTTGATTAACCACGACGTGGTGGAGCCCCTCTACATCGAGGTTGACCAGATATACCATCT 
GGCATCTCCAGCCTCCCCTCCAAACTACATGTATAATCCTATCAAGACATTAAAGACCAATACGATTGGGACATT 
AAACATGTTGGGGCTGGCAAAACGAGTCGGTGCCCGTCTGCTCCTGGCCTCCACATCGGAGGTGTATGGAGATCC 
TGAAGTCCACCCTCAAAGTGAGGATTACTGGGGCCACGTGAATCCAATAGGACCTCGGGCCTGCTACGATGAAGG 
CAAACGTGTTGCAGAGACCATGTGCTATGCCTACATGAAGCAGGAAGGCGTGGAAGTGCGAGTGGCCAGAATCTT 
CAACACCTTTGGGCCACGCATGCACATGAACGATGGGCGAGTAGTCAGCAACTTCATCCTGCAGGCGCTCCAGGG 
GGAGCCACTCACGGTATACGGATCCGGGTCTCAGACAAGGGCGTTCCAGTACGTCAGCGATCTAGTGAATGGCCT 
CGTGGCTCTCATGAACAGCAACGTCAGCAGCCCGGTCAACCTGGGGAACCCAGAAGAACACACAATCCTAGAATT 
TGCTCAGTTAATTAAAAACCTTGTTGGTAGCGGAAGTGAAATTCAGTTTCTCTCCGAAGCCCAGGATGACCCACA 
GAAAAGAAAACCAGACATCAAAAAAGCAAAGCTGATGCTGGGGTGGGAGCCCGTGGTCCCGCTGGAGGAAGGTTT 
AAACAAAGCAATTCACTACTTCCGTAAAGAACTCGAGTACCAGGCAAATAATCAGTACATCCCCAAACCAAAGCC 
TGCCAGAATAAAGAAAGGACGGACTCGCCACAGCTGAACTCCTCACTTTTAGGACACAAGACTACCATTGTACAC 
TTGATGGGATGTATTTTTGGCTTTTTTTTGTTGTCGTTTAAAGAAAGACTTTAACAGGTGTCATGAAGAACAAAC 
TGGAATTTCATTCTGAAGCTTGCTTTAATGAAATGGATGTGCCTAAAAGCTCCCCTCAAAAAACTGCAGATTTTG 
CCTTGCACTTTT.TGAATCTCTCTTTTTATGTAAAATAGCGTAGATGCATCTCTGCGTATTTTCAAGTTTTTTTAT 
CTTGCTGTGAGAGCATATGTTGTGACTGTCGTTGACAGTTTTATTTACTGGTTTCTTTGTGAAGCTGAAAAGGAA 
CATTAAGCGGGACAAAAAATGCCGATTTTATTTATAAAAGTGGGTACTTAATAAATGAGTCGTTATACTATGCAT 
AAAGAAAAATCCTAGCAGTATTGTCAGGTGGTGGTGCGCCGGCATTGATTTTAGGGCAGATAAAAGAATTCTGTG 
TGAGAGCTTTATGTTTCTCTTTTAATTCAGAGTTTTTCCAAGGTCTACTTTTGAGTTGCAAACTTGACTTTGAAA 
TATTCCTGTTGGTCATGATCAAGGATATTTGAAATCACTACTGTGTTTTGCTGCGTATCTGGGGCGGGGGCAGGT 
TGGGGGGCACAAAGTTAACATATTCTTGGTTAACCATGGTTAAATATGCTATTTTAATAAAATATTGAAACTCA 
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FIGURE 260 

MVSKALLRLVSAVNRRRMKLLLGIALLAYVASVWGNFVNMRSIQENGELKIESKIEEMVEPLR 
EKIRDLEKSFTQKYPPVKFLSEKDRKRILITGGAGFVGSHLTDKLMMDGHEVTVVDNFFTGRK 
RNVEHWIGHENFELINHDWEPLYIEVDQIYHLASPASPPNYMYNPIKTLKTNTIGTLNMLGL 
AKRVGARLLLASTSEVYGDPEVHPQSEDYWGHVNPIGPRACYDEGKRVAETMCYAYMKQEGVE 
VRVARIFNTFGPRMHMNDGRWSNFILQALQGEPLTVYGSGSQTRAFQYVSDLVNGLVALMNS 
NVSSPVNLGNPEEHTILEFAQLIKNLVGSGSEIQFLSEAQDDPQKRKPDIKKAKLMLGWEPW 
PLEEGLNKAIHYFRKELEYQANNQYIPKPKPARIKKGRTRHS 

Important features: 
Signal peptide: 

amino a'cids 1-32 

N-glycosylation site: 

amino acids 316-320 

Tyrosine kinase phosphorylation site: 

amino acids 235-244 

N-myristoylation sites: 

amino acids 35-41,101-107,383-389 

Amidation sites: 

amino acids 123-127,233-237 
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FIGURE 261 

GCGTGGTGCGGGGGCGTGGGGAAATCGGGTTGCCCCAGCCGTTACTGGTCCGCGCAGTCAGGG 

CATCCTCCGCATCCTCCACATCCTTCCaTGGCTCTGAAGAATAAATTCAGTTGTTTATGGATC 

TTGGGTCTGTGTTTGGTAGCCACTACATCTTCCAAAATCCCATCCATCACTGACCCACACTTT 

ATAGACAACTGCATAGAAGCCCACAACGAATGGCGTGGCAAAGTCAACCCTCCCGCGGCCGAC 

ATGAAATACATGATTTGGGATAAAGGTTTAGCAAAGATGGCTAAAGCATGGGCAAACCAGTGC 

AAATTTGAACATAATGACTGTTTGGATAAATCATATAAATGCTATGCAGCTTTTGAATATGTT 

GGAGAAAATATCTGGTTAGGTGGAATAAAGTCATTCACACCAAGACATGCCATTACGGCTTGG 

TATAATGAAACCCAATTTTATGATTTTGATAGTCTATCATGCTCCAGAGTCTGTGGCCATTAT 

ACACAGTTAGTTTGGGCCAATTCATTTTATGTCGGTTGTGCAGTTGCAATGTGTCCTAACCTT 

GGGGGAGCTTCAACTGCAATATTTGTATGCAACTACGGACCTGCAGGAAATTTTGCAAATATG 

CCTCCTTACGCAAGAGGAGAATCTTGCTCTCTCTGCTCAAAAGAAGAGAAATGTGTAAAGAAC 

CTCTGCAGGACTCCACAACTTATTATACCTAACCAAAATCCATTTCTGAAGCCAACGGGGAGA 

GCACCTCAGCAGACAGCCTTTAATCCATTCAGCTTAGGTTTTCTTCTTCTGAGAATCTTTTAA 

TGTCATTTATATACAAAAGAAATTCTCAAATGTTAAAATAAAGGAATAGTTTATTGCTTAATA 
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FIGURE 262 

MALKNKFSCLWILGLCLVATTSSKIPSITDPHFIDNCIEAHNEWRGKVNPPAADMKYMIWDKG 
LAKMAKAWANQCKFEHNDCLDKSYKCYAAFEYVGENIWLGGIKSFTPRHAITAWYNETQFYDF 
DSLSCSRVCGHYTQLVWANSFYVGCAVAMCPNLGGASTAIFVCNYGPAGNFANMPPYARGESC 
SLCSKEEKCVKNLCRTPQLIIPNQNPFLKPTGRAPQQTAFNPFSLGFLLLRIF 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site: 

amino acids 119-123 

N-myristoylation sites: 

amino acids 103-109,150-156,160-166,161-167,175-181 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 signature 1: 

amino acids 136-156 



Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 signature 2: 

amino acids 166-178 
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FIGURE 263 

CGCCCTCCGACCCGCCCCGCGGCGCATTGTGGGATCTGTCGGCTTGTCAGGTGGTGGAGGAAA 

AGGCGCTCCGTCATGGGGATCCAGACGAGCCCCGTCCTGCTGGCCTCCCTGGGGGTGGGGCTG 

GTCACTCTGCTCGGCCTGGCTGTGGGCTCCTACTTGGTTCGGAGGTCCCGCCGGCCTCAGGTC 

ACTCTCCTGGACCCCAATGAAAAGTACCTGCTACGACTGCTAGACAAGACGACTGTGAGCCAC 

AACACCAAGAGGTTCCGCTTTGCCCTGCCCACCGCCCACCACACTCTGGGGCTGCCTGTGGGC 

AAACATATCTACCTCTCCACCCGAATTGATGGCAGCCTGGTCATCAGGCCATACACTCCTGTC 

ACCAGTGATGAGGATCAAGGCTATGTGGATCTTGTCATCAAGGTCTACCTGAAGGGTGTGCAC 

CCCAAATTTCCTGAGGGAGGGAAGATGTCTCAGTACCTGGATAGCCTGAAGGTTGGGGATGTG 

GTGGAGTTTCGGGGGCCAAGCGGGTTGCTCACTTACACTGGAAAAGGGCATTTTAACATTCAG 

CCCAACAAGAAATCTCCACCAGAACCCCGAGTGGCGAAGAAACTGGGAATGATTGCCGGCGGG 

ACAGGAATCACCCCAATGCTACAGCTGATCCGGGCCATCCTGAAAGTCCCTGAAGATCCAACC 

CAGTGCTTTCTGCTTTTTGCCAACCAGACAGAAAAGGATATCATCTTGCGGGAGGACTTAGAG 

GAACTGCAGGCCCGCTATCCCAATCGCTTTAAGCTCTGGTTCACTCTGGATCATCCCCCAAAA 

GATTGGGCCTACAGCAAGGGCTTTGTGACTGCCGACATGATCCGGGAACACCTGCCCGCTCCA 

GGGGATGATGTGCTGGTACTGCTTTGTGGGCCACCCCCAATGGTGCAGCTGGCCTGCCATCCC 

AACTTGGACAAACTGGGCTACTCACAAAAGATGCGATTCACCTACTGAGCATCCTCCAGCTTC 

CCTGGTGCTGTTCGCTGCAGTTGTTCCCCATCAGTACTCAAGCACTATAAGCCTTAGATTCCT 

TTCCTCAGAGTTTCAGGTTTTTTCAGTTACATCTAGAGCTGAAATCTGGATAGTACCTGCAGG 

AACAATATTCCTGTAGCCATGGAAGAGGGCAAGGCTCAGTCACTCCTTGGATGGCCTCCTAAA 

TCTCCCCGTGGCAACAGGTCCAGGAGAGGCCCATGGAGCAGTCTCTTCCATGGAGTAAGAAGG 

AAGGGAGCATGTACGCTTGGTCCAAGATTGGCTAGTTCCTTGATAGCATCTTACTCTCACCTT 

CTTTGTGTCTGTGATGAAAGGAACAGTCTGTGCAATGGGTTTTACTTAAACTTCACTGTTCAA 

CCTATGAGCAAATCTGTATGTGTGAGTATAAGTTGAGCATAGCATACTTCCAGAGGTGGTNTT 

ATGGAGATGGCAAGAAAGGAGGAAATGATTTCTTCAGATNTCAAAGGAGTCTGAAATATCATA 

TTTCTGTGTGTGTCTCTCTCAGCCCCTGCCCAGGCTAGAGGGAAACAGCTACTGATAATCGAA 

AACTGCTGTTTGTGGCANGAACCCCTGGCTGTGCAAATAAATGGGGCTGAGGCCCCTGTGTGA 

TATTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGA 
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FIGURE 264 

MGIQTSPVLLASLGVGLVTLLGLAVGSYLVRRSRRPQVTLLDPNEKYLLRLLDKTTVSHNTKR 
FRFALPTAHHTLGLPVGKHIYLSTRIDGSLVIRPYTPVTSDEDQGYVDLVIKVYLKGVHPKFP 
EGGKMSQYLDSLKVGDVVEFRGPSGLLTYTGKGHFNIQPNKKSPPEPRVAKKLGMIAGGTGIT 
PMLQLIRAILKVPEDPTQCFLLFANQTEKDIILREDLEELQARYPNRFKLWFTLDHPPKDWAY 
SKGFVTADMIREHLPAPGDDVLVLLCGPPPMVQLACHPNLDKLGYSQKMRFTY 

Important features: 
Signal peptide: 

amino acids 1-26 

N-glycosylation site: 

amino acids 214-218 

N-myristoylation sites: 

amino acids 22-28-, 76-82, 128-134, 180-186 
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FIGURE 265 

CCCGTGCCAAGAGTGACGTAAGTACCGCCTATAGAGTCTATAGGCCCACTTGGCTTCGTTAGA 

ACGCGGCTACAATTAATACATAACCTTATGTATCATACACATACGATTTAGGTGACACTATAG 

AATAACATCCACTTTGCCTTTCTCTCCACAGGTGTCCACTCCCAGGTCCAACTGCACCTCGGT 

TCTATCGATAATCTCAGCACCAGCCACTCAGAGCAGGGCACGATGTTGGGGGCCCGCCTCAGG 

CTCTGGGTCTGTGCCTTGTGCAGCGTCTGCAGCATGAGCGTCCTCAGAGCCTATCCCAATGCC 

TCCCCACTGCTCGGCTCCAGCTGGGGTGGCCTGATCCACCTGTACACAGCCACAGCCAGGAAC 

AGCTACCACCTGCAGATCCACAAGAATGGCCATGTGGATGGCGCACCCCATCAGACCATCTAC 

AGTGCCCTGATGATCAGATCAGAGGATGCTGGCTTTGTGGTGATTACAGGTGTGATGAGCAGA 

AGATACCTCTGCATGGATTTCAGAGGCAACATTTTTGGATCACACTATTTCGACCCGGAGAAC 

TGCAGGTTCCAACACCAGACGCTGGAAAACGGGTACGACGTCTACCACTCTCCTCAGTATCAC 

TTCCTGGTCAGTCTGGGCCGGGCGAAGAGAGCCTTCCTGCCAGGCATGAACCCACCCCCGTAC 

TCCCAGTTCCTGTCCCGGAGGAACGAGATCCCCCTAATTCACTTCAACACCCCCATACCACGG 

CGGCACACCCGGAGCGCCGAGGACGACTCGGAGCGGGACCCCCTGAACGTGCTGAAGCCCCGG. 

GCCCGGATGACCCCGGCCCCGGCCTCCTGTTCACAGGAGCTCCCGAGCGCCGAGGACAACAGC 

CCGATGGCCAGTGACCCATTAGGGGTGGTCAGGGGCGGTCGAGTGAACACGCACGCTGGGGGA 

ACGGGCCCGGAAGGCTGCCGCCCCTTCGCCAAGTTCATCTAGGGTCGCTGG 
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FIGURE 266 

MLGARLRLWVCALCSVCSMSVLRAYPNASPLLGSSWGGLIHLYTATARNSYHLQIHKNGHVDG 
APHQTIYSALMIRSEDAGFVVITGVMSRRYLCMDFRGNIFGSHYFDPENCRFQHQTLENGYDV 
YHSPQYHFLVSLGRAKRAFLPGMNPPPYSQFLSRRNEIPLIHFNTPIPRRHTRSAEDDSERDP 
LNVLKPRARMTPAPASCSQELPSAEDNSPMASDPLGVVRGGRVNTHAGGTGPEGCRPFAKFI 

Important: features : 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 175-179 

N-myristoylation site. 

amino acids 33-39, 100-106, 225-231, 229-235 

HBGF/FGF family proteins 

amino acids 73-124 
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FIGURE 267 

GGCTGAGGGGAGGCCCGGAGCCTTTCTGGGGCCTGGGGGATCCTCTTGCACTGGTGGGTGGAGAGAAGCGCCTGC 

AGCCAACCAGGGTCAGGCTGTGCTCACAGTTTCCTCTGGCGGCATGTAAAGGCTCCACAAAGGAGTTGGGAGTTC 

AAATGAGGCTGCTGCGGACGGCCTGAGGATGGACCCCAAGCCCTGGACCTGCCGAGCGTGGCACTGAGGCAGCGG 

CTGACGCTACTGTGAGGGAAAGAAGGTTGTGAGCAGCCCCGCAGGACCCCTGGCCAGCCCTGGCCCCAGCCTCTG 

CCGGAGCCCTCTGTGGAGGCAGAGCCAGTGGAGCCCAGTGAGGCAGGGCTGCTTGGCAGCCACCGGCCTGCAACT 

CAGGAACCCCTCCAGAGGCCATGGACAGGC.TGCCCCGCTGACGGCCAGGGTGAAGCATGTGAGGAGCCGCCCCGG 

AGCCAAGCAGGAGGGAAGAGGCTTTCATAGATTCTATTCACAAAGAATAACCACCATTTTGCAAGGACCATGAGG 

CCACTGTGCGTGACATGCTGGTGGCTCGGACTGCTGGCTGCCATGGGAGCTGTTGCAGGCCAGGAGGACGGTTTT 

GAGGGCACTGAGGAGGGCTCGCCAAGAGAGTTCATTTACCTAAACAGGTACAAGCGGGCGGGCGAGTCCCAGGAC 

AAGTGCACCTACACCTTCATTGTGCCCCAGCAGCGGGTCACGGGTGCCATCTGCGTCAACTCCAAGGAGCCTGAG 

GT.GCTTCTGGAGAACCGAGTGCATAAGCAGGAGCTAGAGCTGCTCAACAATGAGCTGCTCAAGCAGAAGCGGCAG 

ATCGAGACGCTGCAGCAGCTGGTGGAGGTGGACGGCGGCATTGTGAGCGAGGTGAAGCTGCTGCGCAAGGAGAGC 

CGCAACATGAACTCGCGGGTCACGCAGCTCTACATGCAGCTCCTGCACGAGATCATCCGCAAGCGGGACAACGCG 

TTGGAGCTCTCCCAGCTGGAGAACAGGATCCTGAACCAGACAGCCGACATGCTGCAGCTGGCCAGCAAGTACAAG 

GAGCTGGAGCACAAGTACCAGCACCTGGCCACACTGGCCCACAACCAATCAGAGATCATCGCGCAGCTTGAGGAG 

CACTGCCAGAGGGTGCCCTCGGCCAGGCCCGTCCCCCAGCCACCCCCCGCTGCCCCGCCCCGGGTCTACCAACCA 

CCCACCTACAACCGCATCATCAACCAGATCTCTACCAACGAGATCCAGAGTGACCAGAACCTGAAGGTGCTGCCA 

CCCCCTCTGCCCACTATGCCCACTCTCACCAGCCTCCCATCTTCCACCGACAAGCCGTCGGGCCCATGGAGAGAC 

TGCCTGCAGGCCCTGGAGGATGGCCACGACACCAGCTCCATCTACCTGGTGAAGCCGGAGAACACCAACCGCCTC 

ATGCAGGTGTGGTGCGACCAGAGACACGACCCCGGGGGCTGGACCGTCATCCAGAGACGCCTGGATGGCTCTGTT 

AACTTCTTCAGGAACTGGGAGACGTACAAGCAAGGGTTTGGGAACATTGACGGCGAATACTGGCTGGGCCTGGAG 

AACATTTACTGGCTGACGAACCAAGGCAACTACAAACTCCTGGTGACCATGGAGGACTGGTCCGGCCGCAAAGTC 
* 

TTTGCAGAATACGCCAGTTTCCGCCTGGAACCTGAGAGCGAGTATTATAAGCTGCGGCTGGGGCGCTACCATGGC 
AATGCGGGTGACTCCTTTACATGGCACAACGGCAAGCAGTTCACCACCCTGGACAGAGATCATGATGTCTACACA 
GGAAACTGTGCCCACTACCAGAAGGGAGGCTGGTGGTATAACGCCTGTGCCCACTCCAACCTCAACGGGGTCTGG 
TACCGCGGGGGCCATTACCGGAGCCGCTACCAGGACGGAGTCTACTGGGCTGAGTTCCGAGGAGGCTCTTACTCA 
CTCAAGAAAGTGGTGATGATGATCCGACCGAACCCCAACACCTTCCACTAAGCCAGCTCCCCCTCCTGACCTCTC 
GTGGCCATTGCCAGGAGCCCACCCTGGTCACGCTGGCCACAGCACAAAGAACAACTCCTCACCAGTTCATCCTGA 
GGCTGGGAGGACCGGGATGCTGGATTCTGTTTTCCGAAGTCACTGCAGCGGATGATGGAACTGAATCGATACGGT 
GTTTTCTGTCCCTCCTACTTTCCTTCACACCAGACAGCCCCTCATGTCTCCAGGACAGGACAGGACTACAGACAA 
CTCTTTCTTTAAATAAATTAAGTCTCTACAATAAAAAAAA 
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FIGURE 268 

MRPLCVTCWWLGLLAAMGAVAGQEDGFEGTEEGSPREFIYLNRYKRAGESQDKCTYTFIVPQQ 
RVTGAICVNSKEPEVLLENRVHKQELELLNNELLKQKRQIETLQQLVEVDGGIVSEVKLLRKE 
SRNMNSRVTQLYMQLLHEIIRKRDNALELSQLENRiLNQTADMLQLASKYKDLEHKYQHLATL 
AHNQSEIIAQLEEHCQRVPSARPVPQPPPAAPPRVYQPPTYNRIINQISTNEIQSDQNLKVLP 
PPLPTMPTLTSLPSSTDKPSGPWRDCLQALEDGHDTSSIYLVKPENTNRLMQVWCDQRHDPGG 
WTVIQRRLDGSVNFFRNWETYKQGFGNIDGEYWLGLENIYWLTNQGNYKLLVTMEDWSGRKVF 
AEYASFRLEPESEYYKLRLGRYHGNAGDSFTWHNGKQFTTLDRDHDVYTGNCAHYQKGGWWYN 
ACAHSNLNGVWYRGGHYRSRYQDGVYWAEFRGGSYSLKKWMMIRPNPNTFH 

Important features: 

Signal peptide: 

amino acids 1-22 

N-glycosylation sites: 

amino acids 164-168,192-196 

cAMP- and cGMP-dependent protein kinase phosphorylation site: 

amino acids 124-128 

Tyrosine kinase phosphorylation sites: 

amino acids 177-184,385-393,385-394,461-468 

N-myristoylation sites: 

amino acids 12-18,18-24,22-28,29-35,114-120,341-347,465-471, 
473-479 

Amidation site: 

amino acids 373-377 

Fibrinogen beta and gamma chains C-terminal domain signature: 

amino acids 438-451 

Fibrinogen beta and gamma chains C-terminal domain proteins: 

amino acids 305-343, 365-402, 411-424, 428-458 



Trehalase proteins: 

amino acids 275-292 



WO 01/40466 



PCT/US00/32678 



269/550 

FTGURE 269 

GCCGAGCTGAGCGGATCCTCACMGACTGTGATCCGATTCTTTCCAGCGGCTTCTGCAACCAA 

GCGGGTCTTACCCCCGGTCCTCCGCGTCTCCAGTCCTCGCACCTGGAACCCCAACGTCCCCGA 

GAGTCCCCGAATCCCCGCTCCCAGGCTACCTAAGAGGATGAGCGGTGCTCCGACGGCCGGGGC 

AGCCCTGATGCTCTGCGCCGCCACCGCCGTGCTACTGAGCGCTCAGGGCGGACCCGTGCAGTC 

CAAGTCGCCGCGCTTTGCGTCCTGGGACGAGATGAATGTCCTGGCGCACGGACTCCTGCAGCT 

CGGCCAGGGGCTGCGCGAACACGCGGAGCGCACCCGCAGTCAGCTGAGCGCGCTGGAGCGGCG 

CCTGAGCGCGTGCGGGTCCGCCTGTCAGGGAACCGAGGGGTCCACCGACCTCCCGTTAGCCCC 

TGAGAGCCGGGTGGACCCTGAGGTCCTTCACAGCCTGCAGACACAACTCAAGGCTCAGAACAG 

CAGGATCCAGCAACTCTTCCACAAGGTGGCCCAGCAGCAGCGGCACCTGGAGAAGCAGCACCT 

GCGAATTCAGCATCTGCAAAGCCAGTTTGGCCTCCTGGACCACAAGCACCTAGACCATGAGGT 

GGCCAAGCCTGCCCGAAGAAAGAGGCTGCCCGAGATGGCCCAGCCAGTTGACCCGGCTCACAA 

TGTCAGCCGCCTGCACCGGCTGCCCAGGGATTGCCAGGAGCTGTTCCAGGTTGGGGAGAGGCA 

GAGTGGACTATTTGAAATCCAGCCTCAGGGGTCTCCGCCATTTTTGGTGAACTGCAAGATGAC 

CTCAGATGGAGGCTGGACAGTAATTCAGAGGCGCCACGATGGCTCAGTGGACTTCAACCGGCC 

CTGGGAAGCCTACAAGGCGGGGTTTGGGGATCCCCACGGCGAGTTCTGGCTGGGTCTGGAGAA 

GGTGCATAGCATCACGGGGGACCGCAACAGCCGCCTGGCCGTGCAGCTGCGGGACTGGGATGG 

CAACGCCGAGTTGCTGCAGTTCTCCGTGCACCTGGGTGGCGAGGACACGGCCTATAGCCTGCA 

GCTCACTGCACCCGTGGCCGGCCAGCTGGGCGCCACCACCGTCCCACCCAGCGGCCTCTCCGT 

ACCCTTCTCCACTTGGGACCAGGATCACGACCTCCGCAGGGACAAGAACTGCGCCAAGAGCCT 

CTCTGGAGGCTGGTGGTTTGGCACCTGCAGCCATTCCAACCTCAACGGCCAGTACTTCCGC.TC 

CATCCCACAGCAGCGGCAGAAGCTTAAGAAGGGAATCTTCTGGAAGACCTGGCGGGGCCGCTA 

CTACCCGCTGCAGGCCACCACCATGTTGATCCAGCCCATGGCAGCAGAGGCAGCCTCCT&SCG 

TCCTGGCTGGGCCTGGTCCCAGGCCCACGAAAGACGGTGACTCTTGGCTCTGCCCGAGGATGT 

GGCCGTTCCCTGCCTGGGCAGGGGCTCCAAGGAGGGGCCATCTGGAAACTTGTGGACAGAGAA 

GAAGACCACGACTGGAGAAGCCCCCTTTCTGAGTGCAGGGGGGCTGCATGCGTTGCCTCCTGA 

GATCGAGGCTGCAGGATATGCTCAGACTCTAGAGGCGTGGACCAAGGGGCATGGAGCTTCACT 

CCTTGCTGGCCAGGGAGTTGGGGACTCAGAGGGACCACTTGGGGCCAGCCAGACTGGCCTCAA 

TGGCGGACTCAGTCACATTGACTGACGGGGACCAGGGCTTGTGTGGGTCGAGAGCGCCCTCAT 

GGTGCTGGTGCTGTTGTGTGTAGGTCCCCTGGGGACACAAGCAGGCGCCAATGGTATCTGGGC 

GGAGCTCACAGAGTTCTTGGAATAAAAGCAACCTCAGAACAC 



WO 01/40466 



PCT/USOO/32678 



270/550 

FIGURE 270 

MTVIRFFPAASATKRVLPPVLRVSSPRTWNPNVPESPRIPAPRLPKRMSGAPTAGAALMLCAA 
TAVLLSAQGGPVQSKSPRFASWDEMNVLAHGLLQLGQGLREHAERTRSQLSALERRLSACGSA 
CQGTEGSTDLPLAPESRVDPEVLHSLQTQLKAQNSRIQQLFHKVAQQQRHLEKQHLRIQHLQS 
QFGLLDHKHLDHEVAKPARRKRLPEMAQPVDPAHNVSRLHRLPRDCQELFQVGERQSGLFEIQ 
PQGSPPFLVNCKMTSDGGWTVIQRRHDGSVDFNRPWEAYKAGFGDPHGEFWLGLEKVHSITGD 
RNSRLAVQLRDWDGNAELLQFSVHLGGEDTAYSLQLTAPVAGQLGATTVPPSGLSVPFSTWDQ 
DHDLRRDKNCAKSLSGGWWFGTCSHSNLNGQYFRSIPQQRQKLKKGIFWKTWRGRYYPLQATT 
ML I Q PMAAE AAS 

Important features: 
Signal peptide: 

Amino acids 1-13 

Transmembrane domain: 

Amino acids 53-70 

N-glycosylation site: 

Amino acids 224-228 

cAMP- and cGMP-dependent protein kinase phosphorylation sites: 

Amino acids 46-50; 118-122 

N-myristoylation sites: 

Amino acids 50-56; 129-135; 341-347; 357-363 

Fibrinogen beta and gamma chains C- terminal domain signature: 

Amino acids 396-409 ♦ 
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FIGURE 271 

CGGACGCGTGGGGGAAACCCTTCCGAGAAAACAGCAACAAGCTGAGCTGCTGTGACAGAGGGG 

AACAAGATGGCGGCGCCGAAGGGGAGCCTCTGGGTGAGGACCCAACTGGGGCTCCCGCCGCTG 

CTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCGCTTCGGCTGAAGCATTTGACTCG 

GTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACACCTAC 

CCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTCAGTTT 

GTGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACAGAAGCA 

TATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCCATTCGCT 

GAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTCCTCTAACT 

CTGGTGAGGTCATTCTGGAGTGACATGATGGACTCCGCACAGAGCTTCATAACCTCTTCATGG 

ACTTTTTATCTTCAAGCCGATGACGGAAAAATAGTTATATTCCAGTCTAAGCCAGAAATCCAG 

TACGCACCACATTTGGAGCAGGAGCCTACAAATTTGAGAGAATCATCTCTAAGCAAAATGTCC 

TATCTGCAAATGAGAAATTCACAAGCGCACAGGAATTTTCTTGAAGATGGAGAAAGTGATGGC 

TTTT.TAAGATGCCTCTCTCTTAACTCTGGGTGGATTTTAACTACAACTCTTGTCCTCTCGGTG 

ATGGTATTGCTTTGGATTTGTTGTGCAACTGTTGCTACAGCTGTGGAGCAGTATGTTCCCTCT 

GAGAAGCTGAGTATCTATGGTGACTTGGAGTTTATGAATGAACAAAAGCTAAACAGATATCCA 

GCTTCTTCTCTTGTGGTTGTTAGATCTAAAACTGAAGATCATGAAGAAGCAGGGCCTCTACCT 

ACAAAAGTGAATCTTGCTCATTCTGAAATTT^AGCATTTTTCTTTTAAAAGACAAGTGTAATA 

GACATCTAAAATTCCACTCCTCATAGAGCTTTTAAAATGGTTTCATTGGATATAGGCCTTAAG 

AAATCACTATAAAATGCAAATAAAGTTACTCAAATCTGTG 
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FIGURE 272 

MAAPKGSLWVRTQLGLPPLLLLTMALAGGSGTASAEAFDSVLGDTASCHRACQLTYPLHTYPK 
EEELYACQRGCRLFSICQFVDDGIDLNRTKLECESACTEAYSQSDEQYACHLGCQNQLPFAEL 
RQEQLMSLMPKMHLLFPLTLVRSFWSDMMDSAQSFITSSWTFYLQADDGKIVIFQSKPEIQYA 
PHLEQEPTNLRESSLSKMSYLQMRNSQAHRNFLEDGESDGFLRCLSLNSGWILTTTLVLSVMV 
LLWICCATVATAVEQYVPSEKLSIYGDLEFMNEQKLNRYPASSLVVVRSKTEDHEEAGPLPTK 

VNLAHSEI 

Important features: 
Signal peptide: 

amino acids 1-31 

Transmembrane domain: 

amino acids 241-260 

N-glycosylation site: 

amino acids 90-94 

N-myristoylation sites: 

amino acids 28-34,29-35,31-37,86-92 
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FIGURE 273 

CCCACGCGTCCGAACCTCTCCAGCGATGGGAGCCGCCCGCCTGCTGCCCAACCTCACTCTGTG 
CTTACAGCTGCTGATTCTCTGCTGTCAAACTCAGTACGTGAGGGACCAGGGCGCCATGACCGA 
CCAGCTGAGCAGGCGGCAGATCCGCGAGTACCAACTCTACAGCAGGACCAGTGGCAAGCACGT 
GCAGGTCACCGGGCGTCGCATCTCCGCCACCGCCGAGGACGGCAACAAGTTTGCCAAGCTCAT 
AGTGGAGACGGACACGTTTGGCAGCCGGGTTCGCATCAAAGGGGCTGAGAGTGAGAAGTACAT 
CTGTATGAACAAGAGGGGCAAGCTCATCGGGAAGCCCAGCGGGAAGAGCAAAGACTGCGTGTT 
CACGGAGATCGTGCTGGAGAACAACTATACGGCCTTCCAGAACGCCCGGCACGAGGGCTGGTT 
CATGGCCTTCACGCGGCAGGGGCGGCCCCGCCAGGCTTCCCGCAGCCGCCAGAACCAGCGCGA 
GGCCCACTTCATCAAGCGCCTCTACCAAGGCCAGCTGCCCTTCCCCAACCACGCCGAGAAGCA 
GAAGCAGTTCGAGTTTGTGGGCTCCGCCCCCACCCGCCGGACCAAGCGCACACGGCGGCCCCA 
GCCCCTCACG5&GTCTGGGAGGCAGGGGGCAGCAGCCCCTGGGCCGCCTCCCCACCCCTTTCC 
CTTCTTAATCCAAGGACTGGGCTGGGGTGGCGGGAGGGGAGCCAGATCCCCGAGGGAGGACCC 
TGAGGGCCGCGAAGCATCCGAGCCCCCAGCTGGGAAGGGGCAGGCCGGTGCCCCAGGGGCGGC 
TGGCACAGTGCCCCCTTCCCGGACGGGTGGCAGGCCCTGGAGAGGAACTGAGTGTCACCCTGA 
TCTCAGGCCACCAGCCTCTGCCGGCCTCCCAGCCGGGCTCCTGAAGCCCGCTGAAAGGTCAGC 
GACTGAAGGCCTTGCAGACAACCGTCTGGAGGTGGCTGTCCTCAAAATCTGCTTCTCGGATCT 
CCCTCAGTCTGCCCCCAGCCCCCAAACTCCTCCTGGCTAGACTGTAGGAAGGGACTTTTGTTT 
GTTTGTTTGTTTCAGGAAAAAAGAAAGGGAGAGAGAGGAAAATAGAGGGTTGTCCACTCCTCA 
CATTCCACGACCCAGGCCTGCACCCCACCCCCAACTCCCAGCCCCGGAATAAAACCATTTTCC 

TGC 
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FIGURE 274 

MGAARLLPNLTLCLQLLILCCQTQYVRDQGAMTDQLSRRQIREYQLYSRTSGKHVQVTGRRIS 
ATAEDGNKFAKLIVETDTFGSRVRIKGAESEKYICMNKRGKLIGKPSGKSKDCVFTEIVLENN 
YTAFQNARHEGWFMAFTRQGRPRQASRSRQNQREAHFIKRLYQGQLPFPNHAEKQKQFEFVGS 
APTRRTKRTRRPQPLT 

Important features: 
Signal peptide: 

Amino acids 1-22 

N-glycosylation site. 

amino acids 9-13, 126-130 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 60-64 

Casein kinase II phosphorylation site. 

amino acids 65-69 

Tyrosine kinase phosphorylation site. 

amino acids 39-48, 89-97 

N-myris toylation site . 

amino acids 69-75, 188-194 

Amidation site. 

amino acids 58-62 

HBGF/FGF family signature. 

amino acids 103-128 
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FIGURE 275 

TATTTACCATATCAGATTCACATTCAGTCCTCAGCAAAATGAAGGGCTCCATTTTCACTCTGT 
TTTTATTCTCTGTCCTATTTGCCATCTCAGAAGTGCGGAGCAAGGAGTCTGTGAGACTCTGTG 
GGCTAGAATACATACGGACAGTCATCTATATCTGTGCTAGCTCCAGGTGGAGAAGGCATCTGG 
AGGGGATCCCTCAAGCTCAGCAAGCTGAGACAGGAAACTCCTTCCAGCTCCCACATAAACGTG 
AGTTTTCTGAGGAAAATCCAGCGCAAAACCTTCCGAAGGTGGATGCCTCAGGGGAAGACCGTC 
TTTGGGGTGGACAGATGCCCACTGAAGAGCTTTGGAAGTCAAAGAAGCATTCAGTGATGTCAA 
GACAAGATTTACAAACTTTGTGTTGCACTGATGGCTGTTCCATGACTGATTTGAGTGCTCTTT 
GCTAAGACAAGAGCAAATACCCAATGGGTGGCAGAGCTTTATCACATGTTTAATTACAGTGTT 
TTACTGCCTGGTAGAACACTAATATTGTGTTATTAAAATGATGGCTTTTGGGTAGGCAAAACT 
TCTTTTCTAAAAGGTATAGCTGAGCGGTTGAAACCACAGTGATCTCTATTTTCTCCCTTTGCC 
AAGGTTAATGAACTGTTCTTTTCAAATTCTACTAATGCTTTGAAATTTCAAATGCTGCGCAAA 

ATTGCAATAAAAATGCTATAAA 
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FIGURE 276 

MKGSIFTLFLFSVLFAISEVRSKESVRLCGLEYIRTVIYICASSRWRRHLEGIPQAQQAETGN 
SFQLPHKREFSEENPAQNLPKVDASGEDRLWGGQMPTEELWKSKKHSVMSRQDLQTLCCTDGC 

SMTDLSALC 

Important features: 
Signal sequence: 

amino acids 1-18 

cAMP- and cGMP-dependent protein kinase phosphorylation site: 

amino acids 107-111 

N-myristoylation sites: 

amino acids 3-9,52-58,96-102,125-131 

Insulin family signature: 

amino acids 121-136 



Insulin family proteins: 

amino acids 28-4 6 
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FIGURE 277 

GCAGCTGGTTACTGCATTTCTCCATGTGGCAGACAGAGCAAAGCCACAACGCTTTCTCTGCTGGATTAAAGACGG 
CCCACAGACCAGAACTTCCACTATACTACTTAAAATTACATAGGTGGCTTGTCAAATTCAATTGATTAGTATTGT 
AAAAGGAAAAAGAAGTTCCTTCTTACAGCTTGGATTCAACGGTCCAAAACAAAAATGCAGCTGCCATTAAAGTCT 
CAGATGAACAAACTTCTACACTGATTTTTAAAATCAAGAATAAGGGCAGCAAGTTTCTGGATTCACTGAATCAAC 
AGAC^CAAAAAGCTGGCAATATAGCAACTATGAAGAGAAAAGCTACTAATAAAATTAACCCAACGCATAGAAGAC 
TTTTTTTTCTCTTCTAAAAACAACTAAGTAAAGACTTAAATTTAAACACATCATTTTACAACCTCATTTCAAAAT 
GAAGACTTTTACCTGGACCCTAGGTGTGCTATTCTTCCTACTAGTGGACACTGGACATTGCAGAGGTGGACAATT 
CAAAATTAAAAAAATAAACCAGAGAAGATACCCTCGTGCCACAGATGGTAAAGAGGAAGCAAAGAAATGTGCATA 
CACATTCCTGGTACCTGAACAAAGAATAACAGGGCCAATCTGTGTCAACACCAAGGGGCAAGATGCAAGTACCAT 
TRAAGACATGATCACCAGGATGGACCTTGAAAACCTGAAGGATGTGCTCTCCAGGCAGAAGCGGGAGATAGATGT 
TCTGCAACTGGTGGTGGATGTAGATGGAAACATTGTGAATGAGGTAAAGCTGCTGAGAAAGGAAAGCCGTAACAT 
GAACTCTCGTGTTACTCAACTCTATATGCAATTATTACATGAGATTATCCGTAAGAGGGATAATTCACTTGAACT 
TTCCCAACTGGAAAACAAAATCCTCAATGTCACCACAGAAATGTTGAAGATGGCAACAAGATACAGGGAACTAGA 
GGTGAAATACGCTTCCTTGACTGATCTTGTCAATAACCAATCTGTGATGATCACTTTGTTGGAAGAACAGTGCTT 
GAGGATATTTTCCCGACAAGACACCCATGTGTCTCCCCCACTTGTCCAGGTGGTGCCACAACATATTCCTAACAG 
CCAACAGTATACTCCTGGTCTGCTGGGAGGTAACGAGATTCAGAGGGATCCAGGTTATCCCAGAGATTTAATGCC 

ACctcCTGATCTGGCAACTTCTCCCACCAAA^ 

ATTCAAAGACTGTCAGCAAGCAAAAGAAGCTGGGCATTCGGTCAGTGGGATTTATATGATTAAA^^ 
CAATGGACCAATGCAGTTATGGTGTGAAAACAGTTTGGACCCTGGGGGTTGGACTGTTATTCAGAAAAGAACAGA 
rrGCTCTGTCAACTTCTTCAGAAATTGGGAAAATTATAAGAAAGGGTTTGGAAACATTGACGGAGAATACTGGCT 
TGGACTGGAAAATATCTATATGCTTAGCAATCAAGATAATTACAAGTTATTGATTGAATT^ 

taaSgtctatgcagaatacagcagctttcgtctggaacctgaaagtgaattctatagactgcgcctgggaa^ 

TTACCAGGGAAATGCAGGGGATTCTATGATGTGGCATAATGGTAAACAATTCACCACACTGGACAGAGATAAAGA 
TATGTATGCAGGAAACTGCGCCCACTTTCATAAAGGAGGCTGGTGGTACAATGCCTGTGCACATTCTAACCTAAA 
TGGAGTATGGTACAGAGGAGGCCATTACAGAAGCAAGCACCAAGATGGAATTTTCTGGGCCGAATACAGAGGCGG 
GTCATACTCCTTAAGAGCAGTTCAGATGATGATCAAGCCTATTGACT^AGAGAGACACTCGCCAATTTAAATGA 
CACAGAACTTTGTACTTTTCAGCTCTTAAAAATGTAAATGTTACATGTATATTACTTGGCACAATTTATTTCTAC 
ACAGAAAGTTTTTAAAATGAATTTTACCGTAACTATAAAAGGGAACCTATAAATGTAGTTTCATCTGTCGTCAAT 

?actg^gaaa!t^tgtatccacaacctagttattttaaaaattatgttgactaaa^^ 

TAAAATGTAAATATTTGCCACAATGTAAAGCAAATCTTAGCTATATTTTAAATCATAAATAACATGTTCAAGATA 
CTTAACAATTTATTTAAAATCTAAGATTGCTCTAACGTCTAGTGAAAAAAATATTTTTTAAATTTCAGCCAAATA 
ATGCAT^TATTTTATAAAAATACAGACAGAAAATTAGGGAGAAACTTCTAGTTTTGCCAATAGAAAATGTTCTT 
CCATTGAATAAAAGTTATTTCAAATTGAATTTGTGCCTTTCACACGTAATGATTAAATCTGAATTCTTAATAATA 
TATCCTATGCTGATTTTCCCAAAACATGACCCATAGTATTAAATACATATCATTTTTAAAAATAAAAAAAAACCC 
AA^TAATGCATGCATAATTTAAATGGTCAATTTATAAAGACAAATCTATGAATGAATTTTTCAGTGTTATCTT 

C^GMATG?TGAACA?CAAAATCTCC 

ACAAAAATGCAGAATATTTAGTGTGCTACAGATCTGAATTATAGTTCTAATTTATTATTACTTTTTTTCTAATTT 
AC^TTACTACTACAAAGAAAAAAAAACCCAACCCATCTGCA^^ 

AAGTATTAGTGCATGCTCAGAACAGGTGGGACTAAAACAAACTCAAGGAACTGTTGGCTGTTTTCCCGATA^ 
GAATTCAACAGCTCCAGAGCAGAAGCCACAGGGGCATAGCTTAGTCCAAAC 

GTAACGCTTAGTCTCACAGTGTCTTTAACTCATCTTTGCAATCAACAACTTTACTAGTGACTTTCTGGAACAATT 
TCCTTTCAGGAATACATATTCACTGCTTAGAGGTGACCTTGCCTTAATATATTTGTGAAGTTAAAATTTTAAAGA 
TAGCTCATGAAACTTTTGCTTAAGCAAAAAGAAAACCTCGAATTGAAATGTGTGAGGCAAACTATGCATGGGAAT 
AGCTTAATGTGAAGATAATCATTTGGACAACTCAAATCCATCAACATGACCAATGTTTTTCATCTGCCACATCTC 
AAAATAAAACTTCTGGTGAAACAAATTAAACAAAATATCCAAACCTCAAAAAAAA 
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FIGURE 278 

MKTFTWTLGVLFFLLVDTGHCRGGQFKIKKINQRRYPRATDGKEEAKKCAYTFLVPEQRITGP 
ICVNTKGQDASTIKDMITRMDLENLKDVLSRQKREIDVLQLVVDVDGNIVNEVKLLRKESRNM 
NSRVTQLYMQLLHEIIRKRDNSLELSQLENKILNVTTEMLKMATRYRELEVKYASLTDLVNNQ 
SVMITLLEEQCLRIFSRQDTHVSPPLVQWPQHIPNSQQYTPGLLGGNEIQRDPGYPRDLMPP 
PDLATSPTKSPFKIPPVTFINEGPFKDCQQAKEAGH-SVSGIYMIKPENSNGPMQLWCENSLDP 
GGWTVIQKRTDGSVNFFRNWENYKKGFGNIDGEYWLGLENIYMLSNQDNYKLLIELEDWSDKK 
VYAEYSSFRLEPESEFYRLRLGTYQGNAGDSMMWHNGKQFTTLDRDKDMYAGNCAHFHKGGWW 
YNACAHSNLNGVWYRGGHYRSKHQDGIFWAEYRGGSYSLRAVQMMIKPID 

Important features : 
Signal sequence: 

Amino acids 1-23 

N-glycosylation sites: 

Amino acids 160-164 ; 188-192 

cAMP- and cGMP -dependent protein kinase phosphorylation site: 

Amino acids 120-124 

Tyrosine kinase phosphorylation sites: 

Amino acids 173-180 ; 387-396 

N-myristoylation sites : 

Amino acids 70-76; 110-116; 232-238 , 343-349; 400-406; 467-473; 
475-487 

Fibrinogen beta and gamma chains C- terminal domain signature: 

Amino acids 440-453 
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FIGURE 279 

CCCACGCGTCCGCGCAGTCGCGCAGTTCTGCCTCCGCCTGCCAGTCTCGCCCGCGATCCCGGC 

CCGGGGCTGTGGCGTCGACTCCGACCCAGGCAGCCAGCAGCCCGCGCGGGAGCCGGACCGCCG 

CCGGAGGAGCTCGGACGGCATGCTGAGCCCCCTCCTTTGCTGAAGCCCGAGTGCGGAGAAGCC 

CGGGCAAACGCAGGCTAAGGAGACCAAAGCGGCGAAGTCGCGAGACAGCGGACAAGCAGCGGA 

GGAGAAGGAGGAGGAGGCGAACCCAGAGAGGGGCAGCAAAAGAAGCGGTGGTGGTGGGCGTCG 

TGGCCATGGCGGCGGCTATCGCCAGCTCGCTCATCCGTCAGAAGAGGCAAGCCCGCGAGCGCG 

AGAAATCCAACGCCTGCAAGTGTGTCAGCAGCCCCAGCAAAGGCAAGACCAGCTGCGACAAAA 

ACAAGTTAAATGTCTTTTCCCGGGTCAAACTCTTCGGCTCCAAGAAGAGGCGCAGAAGAAGAC 

CAGAGCCTCAGCTTAAGGGTATAGTTACCAAGCTATACAGCCGACAAGGCTACCACTTGCAGC 

TGCAGGCGGATGGAACCATTGATGGCACCAAAGATGAGGACAGCACTTACACTCTGTTTAACC 

TCATCCCTGTGGGTCTGCGAGTGGTGGCTATCCAAGGAGTTCAAACCAAGCTGTACTTGGCAA 

TGAACAGTGAGGGATACTTGTACACCTCGGAACTTTTCACACCTGAGTGCAAATTCAAAGAAT 

CAGTGTTTGAAAATTATTATGTGACATATTCATCAATGATATACCGTCAGCAGCAGTCAGGCC 

GAGGGTGGTATCTGGGTCTGAACAAAGAAGGAGAGATCATGAAAGGCAACCATGTGAAGAAGA 

ACAAGCCTGCAGCTCATTTTCTGCCTAAACCACTGAAAGTGGCCATGTACAAGGAGCCATCAC 

TGCACGATCTCACGGAGTTCTCCCGATCTGGAAGCGGGACCCCAACCAAGAGCAGAAGTGTCT 

CTGGCGTGCTGAACGGAGGCAAATCCATGAGCCACAATGAATCAACGTAGCCAGTGAGGGCAA 

AAGAAGGGCTCTGTAACAGAACCTTACCTCCAGGTGCTGTTGAATTCTTCTAGCAGTCCTTCA 

CCCAAAAGTTCAAATTTGTCAGTGACATTTACCAAACAAACAGGCAGAGTTCACTATTCTATC 

TGCCATTAGACCTTCTTATCATCCATACTAAAGC 
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FIGURE 280 

MAAAIASSLIRQKRQAREREKSNACKCVSSPSKGKTSCDKNKLNVFSRVKLFGSKKRRRRRPE 
PQLKGIVTKLYSRQGYHLQLQADGTIDGTKDEDSTYTLFNLIPVGLRVVAIQGVQTKLYLAMN 
SEGYLYTSELFTPECKFKESVFENYYVTYSSMIYRQQQSGRGWYLGLNKEGEIMKGNHVKKNK 
PAAHFLPKPLKVAMYKEPSLHDLTEFSRSGSGTPTKSRSVSGVLNGGKSMSHNEST 

Important Features: 
N-glycosylation site: 

Amino acids 242-246 

Glycosaminoglycan attachment sites: 

Amino acids 165-169, 218-222 

Tyrosine kinase phosphorylation site: 

Amino acids 93-100 

N-myristoylation sites: 

Amino acids 87-93, 231-237 
* 

ATP/GTP-binding site motif A (P-loop) : 

Amino acids 231-239 



HBGF/FGF family proteins: 

Amino acids 78-94, 102-153 
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FIGURE 281 

CCAGGATGGAGCTGGGGCCTGTATAGCCATATTATTGTTCTATGCTACTAGACATGGGGGGGA 

CTTGGTGAAAAAGGTATTATCCAGCCAGAGGGTCTGGGAGCCCTGTCTTACTGAACCTGGGCA 

ACCTGGATATTCTGAGACATATTTTGGGGGGATTTCAGTGAAAAAAGTGGGGGATCCCCTCCA 

TTTAGAGTGTAGCAAAGGAAAAAACACCAAGGTTGGGTTCCTTCCTGACATTGGCAGTGCCCC 

AGTAGGGGTGGGATGAGCGAATATTCCCAAAGCTAAAGTCCCACACCCTGTAGATTACAAGAG 

TGGATTTGGCAGGAGTGTGCCCCAAAATACAGTGGAAAGGTGCCTGAAGATATTTAAACCACG 

TCTTGGAAATTTAGTGGGTCTTGGCTTTGGGATAGGTGAAGTGAGGACAGACACTGGAGAGGA 

GGGAAAGGGGACGTTTTCAATAGGAGGCAAAACTCGAGGGTGGGATCCACTGAGGAGTACATA 

GGCTGCTGGATCTGGTGGAGCCAGCACTGGGCCCACGGGTGGTAACTGGCTGCTGTGGAGGGG 

GGTACGTGAGGGGGGGGTCTGGGGCTTATCCTCAGGTCCTGTGGGTGGGGCAGCGAGTCGGGG 

CCTGAGCGTCAAGAGCATGCCCTAGTGAGCGGGCTCCTCTGGGGGAGCCCAGCGCGCTCCGGG 

CGCCTGCCGGTTTGGGGGTGTCTCCTCCCGGGGCGCTAT6GCGGCGCTGGCCAGTAGCCTGAT 

CCGGCAGAAGCGGGAGGTCCGCGAGCCCGGGGGCAGCCGGCCGGTGTCGGCGCAGCGGCGCGT 

GTGTCCCCGCGGCACCAAGTCCCTTTGCCAGAAGCAGCTCCTCATCCTGCTGTCCAAGGTGCG 

ACTGTGCGGGGGGCGGCCCGCGCGGCCGGACCGCGGCCCGGAGCCTCAGCTCAAAGGCATCGT 

CACCAAACTGTTCTGCCGCCAGGGTTTCTACCTCCAGGCGAATCCCGACGGAAGCATCCAGGG 

CACCCCAGAGGATACCAGCTCCTTCACCCACTTCAACCTGATCCCTGTGGGCCTCCGTGTGGT 

CACCATCCAGAGCGCCAAGCTGGGTCACTACATGGCCATGAATGCTGAGGGACTGCTCTACAG 

TTCGCCGCATTTCACAGCTGAGTGTCGCTTTAAGGAGTGTGTCTTTGAGAATf ACTACGTCCT 

GTACGCCTCTGCTCTCTACCGCCAGCGTCGTTCTGGCCGGGCCTGGTACCTCGGCCTGGACAA 

GGAGGGCCAGGTCATGAAGGGAAACCGAGTTAAGAAGACCAAGGCAGCTGCCCACTTTCTGCC 

CAAGCTCCTGGAGGTGGCCATGTACCAGGAGCCTTCTCTCCACAGTGTCCCCGAGGCCTCCCC 

TTCCAGTCCCCCTGCCCCCT6AAATGTAGTCCCTGGACTGGAGGTTCCCTGCACTCCCAGTGA 

GCCAGCCACCACCACAACCTGT 
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FIGURE 282 

MAALASSLIRQKREVREPGGSRPVSAQRRVCPRGTKSLCQKQLLILLSKVRLCGGRPARPDRG 
PEPQLKGIVTKLFCRQGFYLQANPDGSIQGTPEDTSSFTHFNLIPVGLRVVTIQSAKLGHYMA 
MNAEGLLYSSPHFTAECRFKECVFENYYVLYASALYRQRRSGRAWYLGLDKEGQVMKGNRVKK 

TKAAAHFLPKLLEVAMYQEPSLHSVPEASPSSPPAP 
Important features: 

Tyrosine kinase phosphorylation site: 

Amino acids 199-207 

N-myristoylation sites: 

Amino acids 54-60; 89-95; 131-137 

HBGF/FGF family signature: 

Amino acids 131-155 
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FIGURE 283 

ATGGCCGCGGCCATCGCTAGCGGCTTGATCCGCCAGAAGCGGCAGGCGCGGGAGCAGCACTGG 
GACCGGCCGTCTGCCAGCAGGAGGCGGAGCAGCCCCAGCAAGAACCGCGGGCTCTGCAACGGC 
AACCTGGTGGATATCTTCTCCAAAGTGCGCATCTTCGGCCTCAAGAAGCGCAGGTTGCGGCGC 
CAAGATCCCCAGCTCAAGGGTATAGTGACCAGGTTATATTGCAGGCAAGGCTACTACTTGCAA 
ATGCACCCCGATGGAGCTCTCGATGGAACCAAGGATGACAGCACTAATTCTACACTCTTCAAC 
CTCATACCAGTGGGACTACGTGTTGTTGCCATCCAGGGAGTGAAAACAGGGTTGTATATAGCC 
ATGAATGGAGAAGGTTACCTCTACCCATCAGAACTTTTTACCCCTGAATGCAAGTTTAAAGAA 
TCTGTTTTTGAAAATTATTATGTAATCTACTCATCCATGTTGTACAGACAACAGGAATCTGGT 
AGAGCCTGGTTTTTGGGATTAAATAAGGAAGGGCAAGCTATGAAAGGGAACAGAGTAAAGAAA 
ACCAAACCAGCAGCTCATTTTCTACCCAAGCCATTGGAAGTTGCCATGTACCGAGAACCATCT 
TTGCATGATGTTGGGGAAACGGTCCCGAAGCCTGGGGTGACGCCAAGTAAAAGCACAAGTGCG 
TCTGCAATAATGAATGGAGGCAAACCAGTCAACAAGAGTAAGACAACATA6 
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FIGURE 284 

MAAAIASGLIRQKRQAREQHWDRPSASRRRSSPSKNRGLCNGNLVDIFSKVRIFGLKKRRLRR 
QDPQLKGIVTRLYCRQGYYLQMHPDGALDGTKDDSTNSTLFNLIPVGLRWAIQGVKTGLYIA 
MNGEGYLYPSELFTPECKFKESVFENYYVIYSSMLYRQQESGRAWFLGLNKEGQAMKGNRVKK 
TKPAAHFLPKPLEVAMYREPSLHDVGETVPKPGVTPSKSTSASAIMNGGKPVNKSKTT 

Important features: 
N-glycosylation sites: 

Amino acids 100-104, 242-246 

cAMP- and cGMP-dependent protein kinase phosphorylation sites: 

Amino acids 28-32, 29-33 

Tyrosine kinase phosphorylation site: 

Amino acids 199-207 

N-myristoylation sites: 

Amino acids 38-44, 89-95, 118-124, 122-128, 222-228 

HBGF/FGF family proteins : 

Amino acids 104-155, 171-198 
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FIGURE 285 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGTTCAGGTCCAGGTTTTGCTTTGA 

TCCTTTTCAAAAACTGGAGACACAGAAGAGGGCTCTAGGAAAAAGTTTTGGATGGGATTATGTGGAAACTACCCT 

GCGATTCTCTGCTGCCAGAGCAGGCTCGGCGCTTCCACCCCAGTGCAGCCTTCCCCTGGCGGTGGTGAAAGAGAC 

TCGGGAGTCGCTGCTTCCAAAGTGCCCGCCGTGAGTGAGCTCTCACCCCAGTCAGCCAAATGAGCCTCTTCGGGC 

TTCTCCTGCTGACATCTGCCCTGGCCGGCCAGAGACAGGGGACTCAGGCGGAATCCAACCTGAGTAGTAAATTCC 

AGTTTTCCAGCAACAAGGAACAGAACGGAGTACAAGATCCTCAGCATGAGAGAATTATTACTGTGTCTACTAATG 

GAAGTATTCACAGCCCAAGGTTTCCTCATACTTATCCAAGAAATACGGTCTTGGTATGGAGATTAGTAGCAGTAG 

AGGAAAATGTATGGATACAACTTACGTTTGATGAAAGATTTGGGCTTGAAGACCCAGAAGATGACATATGCAAGT 

ATGATTTTGTAGAAGTTGAGGAACCCAGTGATGGAACTATATTAGGGCGCTGGTGTGGTTCTGGTACTGTACCAG 

GAAAACAGATTTCTAAAGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAACCAGGGT 

TCTGCATCCACTACAACATTGTCATGCCACKATTCACAGAAGCTGTGAGTCCTTCAGTGCTACCCCCTTCAGCTT 

TGCCACTGGACCTGCTTAATAATGCTATAACTGCCTTTAGTACCTTGGAAGACCTTATTCGATATCTTGAACCAG 

AGAGATGGCAGTTGGACTTAGAAGATCTATATAGGCCAACTTGGCAACTTCTTGGCAAGGCTTTTGTTTTTGGAA 

GAAAATCCAGAGTGGTGGATCTGAACCTTCTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCT 

CAGTGTCCATAAGGGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTGTG 

GTGGGAACTGTGCCTGTTGTCTCCACAATTGCAATGAATGTCAATGTGTCCCAAGCAAAGTTACTAAAAAATACC 

ACGAGGTCCTTCAGTTGAGACCAAAGACCGGTGTCAGGGGATTGCACAAATCACTCACCGACGTGGCCCTGGAGC 

ACCATGAGGAGTGTGACTGTGTGTGCAGAGGGAGCACAGGAGGATAGCCGCATCACCACCAGCAGCTCTTGCCCA 

GAGCTGTGCAGTGCAGTGGCTGATTCTATTAGAGAACGTATGCGTTATCTCCATCCTTAATCTCAGTTGTTTGCT 

TCAAGGACCTTTCATCTTCAGGATTTACAGTGCATTCTGAAAGAGGAGACATCAAACAGAATTAGGAGTTGTGCA 

ACAGCTCTTTTGAGAGGAGGCCTAAAGGACAGGAGAAAAGGTCTTCAATCGTGGAAAGAAAATTAAATGTTGTAT 

TAAATAGATCACCAGCTAGTTTCAGAGTTACCATGTACGTATTCCACTAGCTGGGTTCTGTATTTCAGTTCTTTC 

GATACGGCTTAGGGTAATGTCAGTACAGGAAAAAAACTGTGCAAGTGAGCACCTGATTCCGTTGCCTTGCTTAAC 

TCTAAAGCTCCATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTTTTTTTTTTTGCTCATATTCACAT 

ATGTAAACCAGAACATTCTATGTACTACAAACCTGGTTTTTAAAAAGGAACTATGTTGCTATGAATTAAACTTGT 

GTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAATTTCTGCCATTTAGAAGAAGAGAACTACA 

TTCATGGTTTGGAAGAGATAAACCTGAAAAGAAGAGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTG 

TTTCATTGTGTACATTTTTATATTCTCCTTTTGACATTATAACTGTTGGCTTTTCTAATCTTGTTAAATATATCT 

ATTTTTACCAAAGGTATTTAATATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATTTTTCT 

AAACACAATTGTTATAGCCAGAGGAACAAAGATGATATAAAATATTGTTGCTCTGACAAAAATACATGTATTTCA 

TTCTCGTATGGTGCTAGAGTTAGATTAATCTGCATTTTAAAAAACTGAATTGGAATAGAATTGGTAAGTTGCAAA 

GACTTTTTGAAAATAATTAAATTATCATATCTTCCATTCCTGTTATTGGAGATGAAAATAAAAAGCAACTTATGA 

AAGTAGACATTCAGATCCAGCCATTACTAACCTATTCCTTTTTTGGGGAAATCTGAGCCTAGCTCAGAAAAACAT 

AAAGCACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACATCCTATTTA 

TTGTGATGTTGTGGTTTTATTATCTTAAACTCTGTTCCATACACTTG.TATAAATACATGGATATTTTTATGTACA 

GAAGTATGTCTCTTAACCAGTTCACTTATTGTACTCTGGCAATTTAAAAGAAAATCAGTAAAATATTTTGCTTGT 

AAAATGCTTAATATNGTGCCTAGGTTATGTGGTGACTATTTGAATCAAAAATGTATTGAATCATCAAATAAAAGA 

ATGTGGCTATTTTGGGGAGAAAATTAAAAAAAAAAAAAAAAAAAAAGGTTTAGGGATAACAGGGTAATGCGGCC 
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FIGURE 286 

MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERIITVSTNGSIHSPRF 
PHTYPRNTVLVWRLVAVEENVWIQLTFDERFGLEDPEDDICKYDFVEVEEPSDGTILGRWCGS 
GTVPGKQISKGNQIRIRFVSDEYFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLLNNA 
ITAFSTLEDLIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLNLLTEEVRLYSCTP 
RNFSVSIREELKRTDTIFWPGCLLVKRCGGNCACCLHNCNECQCVPSKVTKKYHEVLQLRPKT 
GVRGLHKSLTDVALEHHEECDCVCRGSTGG 

Important features: 
signal sequence: 

Amino acids 1-14 

N-glycosylation sites: 

Amino acids 25-29; 55-59; 254-258 

N-myristoylation sites: 

Amino acids 15-21; 11.7-123; 127-133;281-287;282-288;319-325 



Amidation site: 

Amino acids 229-233 
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FIGURE 287 

CAGCGCTGACTGCGCCGCGGAGAAAGCCAGTGGGAACCCAGACCCATAGGAGACCCGCGTCCC 

CGCTCGGCCTGGCCAGGCCCCGCGCTATGGAGTTCCTCTGGGCCCCTCTCTTGGGTCTGTGCT 

GCAGTCTGGCCGCTGCTGATCGCCACACCGTCTTCTGGAACAGTTCAAATCCCAAGTTCCGGA 

ATGAGGACTACACCATACATGTGCAGCTGAATGACTACGTGGACATCATCTGTCCGCACTATG 

AAGATCACTCTGTGGCAGACGCTGCCATGGAGCAGTACATACTGTACCTGGTGGAGCATGAGG 

AGTACCAGCTGTGCCAGCCCCAGTCCAAGGACCAAGTCCGCTGGCAGTGCAACCGGCCCAGTG 

CCAAGCATGGCCCGGAGAAGCTGTCTGAGAAGTTCCAGCGCTTCACACCTTTCACCCTGGGCA 

AGGAGTTCAAAGAAGGACACAGCTACTACTACATCTCCAAACCCATCCACCAGCATGAAGACC 

GCTGCTTGAGGTTGAAGGTGACTGTCAGTGGCAAAATCACTCACAGTCCTCAGGCCCATGACA 

ATCCACAGGAGAAGAGACTTGCAGCAGATGACCCAGAGGTGCGGGTTCTACATAGCATCGGTC 

ACAGTGCTGCCCCACGCCTCTTCCCACTTGCCTGGACTGTGCTGCTCCTTCCACTTCTGCTGC 

TGCAAACCCCGTGAAGGTGTGTGCCACACCTGGCCTTAAAGAGGGACAGGCTGAAGAGAGGGA 

CAGGCACTCCAAACCTGTCTTGGGGCCACTTTCAGAGCCCCCAGCCCTGGGAACCACTCCCAC 

CACAGGCATAAGCTATCACCTAGCAGCCTCAAAACGGGTCAATATTAAGGTTTTCAACCGGAA 

GGAGGCCAACCAGCCCGACAGTGCCATCCCCACCTTCACCTCGGAGGGATGGAGAAAGAAGTG 

GAGACAGTCCTTTCCCACCATTCCTGCCTTTAAGCCAAAGAAACAAGCTGTGCAGGCATGGTC 

CCTTAAGGCACAGTGGGAGCTGAGCTGGAAGGGGCCACGTGGATGGGCAAAGCTTGTCAAAGA 

TGCCCCCTTCAGGAGAGAGCCAGGATGCCCAGATGAACTGACTGAAGGAAAAGCAAGAAACAG 

TTTCTTGCTTGGAAGCCAGGTACAGGAGAGGCAGCATGCTTGGGCTGACCCAGCATCTCCCAG 

CAAGACCTCATCTGTGGAGCTGCCACAGAGAAGTTTGTAGCCAGGTACTGCATTCTCTCCCAT 

CCTGGGGCAGCACTCCCCAGAGCTGTGCCAGCAGGGGGGCTGTGCCAACCTGTTCTTAGAGTG 

TAGCTGTAAGGGCAGTGCCCATGTGTACATTCTGCCTAGAGTGTAGCCTAAAGGGCAGGGCCC 

ACGTGTATAGTATCTGTATATAAGTTGCTGTGTGTCTGTCCTGATTTCTACAACTGGAGTTTT 

TTTATACAATGTTCTTTGTCTCAAAATAAAGCAATGTGTTTTTTCGG 
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FIGURE 288 

MEFLWAPLLGLCCSLAAADRHTVFWNSSNPKFRNEDYTIHVQLNDYVDIICPHYEDHSADAAM 
EQYILYLVEHEEYQLCQPQSKDQVRWQCNRPSAKHGPEKLSEKFQRFTPFTLGKEFKEGHSYY 
YISKPIHQHEDRCLRLKVTVSGKITHSPQAHDNPQEKRLAADDPEVRVLHSIGHSAAPRLFPL 

AWTVLLLPLLLLQTP 

Important features: 
Signal sequence: 

Amino acids 1-17 

N-glycosylation site: 

Amino acids 26-30 

Tyrosine kinase phosphorylation site: 

Amino acids 118-127 

N-myristoylation site: 

Amino acids 10-16 
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FTGURE 289 

CGGACGCGTGGGCGGACGCGTGGGCGGCCCACGGCGCCCGCGGGCTGGGGCGGTCGCTTCTTC 

CTTCTCCGTGGCCTACGAGGGTCCCCAGCCTGGGTAAAGATGGCCCCATGGCCCCCGAAGGGC 

CTAGTCCCAGCTGTGCTCTGGGGCCTCAGCCTCTTCCTCAACCTCCCAGGACCTATCTGGCTC 

CAGCCCTCTCCACCTCCCCAGTCTTCTCCCCCGCCTCAGCCCCATCCGTGTCATACCTGCCGG 

GGACTGGTTGACAGCTTTAACAAGGGCCTGGAGAGAACCATCCGGGACAACTTTGGAGGTGGA 

AACACTGCCTGGGAGGAAGAGAATTTGTCCAAATACAAAGACAGTGAGACCCGCCTGGTAGAG 

GTGCTGGAGGGTGTGTGCAGCAAGTCAGACTTCGAGTGCCACCGCCTGCTGGAGCTGAGTGAG 

GAGCTGGTGGAGAGCTGGTGGTTTCACAAGCAGCAGGAGGCCCCGGACCTCTTCCAGTGGCTG 

TGCTCAGATTCCCTGAAGCTCTGCTGCCCCGCAGGCACCTTCGGGCCCTCCTGCCTTCCCTGT 

CCTGGGGGAACAGAGAGGCCCTGCGGTGGCTACGGGCAGTGTGAAGGAGAAGGGACACGAGGG 

GGCAGCGGGCACTGTGACTGCCAAGCCGGCTACGGGGGTGAGGCCTGTGGCCAGTGTGGCCTT 

GGCTACTTTGAGGCAGAACGCAACGCCAGCCATCTGGTATGTTCGGCTTGTTTTGGCCCCTGT 

GCCCGATGCTCAGGACCTGAGGAATCAAACTGTTTGCAATGCAAGAAGGGCTGGGCCCTGCAT 

CACCTCAAGTGTGTAGACATTGATGAGTGTGGCACAGAGGGAGCCAACTGTGGAGCTGACCAA 

TTCTGCGTGAACACTGAGGGCTCCTATGAGTGCCGAGACTGTGCCAAGGCCTGCCTAGGCTGC 

ATGGGGGCAGGGCCAGGTCGCTGTAAGAAGTGTAGCCCTGGCTATCAGCAGGTGGGCTCCAAG 

TGTCTCGATGTGGATGAGTGTGAGACAGAGGTGTGTCCGGGAGAGAACAAGCAGTGTGAAAAC 

ACCGAGGGCGGTTATCGCTGCATCTGTGCCGAGGGCTACAAGCAGATGGAAGGCATCTGTGTG 

AAGGAGCAGATCCCAGAGTCAGCAGGCTTCTTCTCAGAGATGACAGAAGACGAGTTGGTGGTG 

CTGCAGCAGATGTTCTTTGGCATCATCATCTGTGCACTGGCCACGCTGGCTGCTAAGGGCGAC 

TTGGTGTTCACCGCCATCTTCATTGGGGCTGTGGCGGCCATGACTGGCTACTGGTTGTCAGAG 

CGCAGTGACCGTGTGCTGGAGGGCTTCATCAAGGGCAGATAATCGCGGCCACCACCTGTAGGA 

CCTCCTCCCACCCACGCTGCCCCCAGAGCTTGGGCTGCCCTCCTGCTGGACACTCAGGACAGC 

TTGGTTTATTTTTGAGAGTGGGGTAAGCACCCGTACCTGCCTTACAGAGCAGCCCAGGTACCC 

AGGCCCGGGCAGACAAGGCCCCTGGGGTAAAAAGTAGCCCTGAAGGTGGATACCATGAGCTCT 

TCACCTGGCGGGGACTGGCAGGCTTCACAATGTGTGAATTTCAAAAGTTTTTCCTTAATGGTG 

GCTGCTAGAGCTTTGGCCCCTGCTTAGGATTAGGTGGTCCTCACAGGGGTGGGGCCATCACAG 

CTCCCTCCTGCCAGCTGCATGCTGCCAGTTCCTGTTCTGTGTTCACCACATCCCCACACCCCA 

TTGCCACTTATTTATTCATCTCAGGAAATAAAGAAAGGTCTTGGAAAGTTAAAAAAAAAAAAA 

AAAAAAAAAAA 



WO 01/40466 



PCT/US00/32678 



290/550 

FIGURE 290 

MAPWPPKGLVPAVLWGLSLFLNLPGPIWLQPSPPPQSSPPPQPHPCHTCRGLVDSFNKGLERT 
IRDNFGGGNTAWEEENLSKYKDSETRLVEVLEGVCSKSDFECHRLLELSEELVESWWFHKQQE 
APDLFQWLCSDSLKLCCPAGTFGPSCLPCPGGTERPCGGYGQCEGEGTRGGSGHCDCQAGYGG 
EACGQCGLGYFEAERNASHLVCSACFGPCARCSGPEESNCLQCKKGWALHHLKCVDIDECGTE 
GANCGADQFCVNTEGSYECRDCAKACLGCMGAGPGRCKKCSPGYQQVGSKCLDVDECETEVCP 
GENKQCENTEGGYRCICAEGYKQMEGICVKEQIPESAGFFSEMTEDELWLQQMFFGIIICAL 
ATLAAKGDLVFTAIFIGAVAAMTGYWLSERSDRVLEGFIKGR 

Important features: 
Signal sequence: 

Amino acids 1-29 

Transmembrane domain: 

Amino acids 342-392 

N-glycosylation sites: 

Amino acids 79-83;205-209 

cAMP- and cGMP-dependent protein kinase phosphorylation site: 

Amino acids 290-294 

Aspartic acid and asparagine hydroxylation site: 

Amino acids 321-333 

EGF-like domain cysteine pattern signature: 

Amino acids 181-193 



WO 01/40466 



PCT/US00/32678 



291/550 
FTGURE 291 

CAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCTCGACCTCGACCCAC 

GCGTCCGAACACAGGTCCTTGTTGCTGCAGAGAAGCAGTTGTTTTGCTGGAAGGAGGGAGTGCGCGGGCTGCCCC 

GGGCTCCTCCCTGCCGCCTCCTCTCAGTGGATGGTTCCAGGCACCCTGTCTGGGGCAGGGAGGGCACAGGCCTGC 

ACATCGAAGGTGGGGTGGGACCAGGCTGCCCCTCGCCCCAGCATCCAAGTCCTCCCTTGGGCGCCCGTGGCCCTG 

CAGACTCTCAGGGCTAAGGTCCTCTGTTGCTTTTTGGTTCCACCTTAGAAGAGGCTCCGCTTGACTAAGAGTAGC 

TTGAAGGAGGCACCATGCAGGAGCTGCATCTGCTCTGGTGGGCGCTTCTCCTGGGCCTGGCTCAGGCCTGCCCTG 

AGCCCTGCGACTGTGGGGAAAAGTATGGCTTCCAGATCGCCGACTGTGCCTACCGCGACCTAGAATCCGTGCCGC 

CTGGCTTCCCGGCCAATGTGACTACACTGAGCCTGTCAGCCAACCGGCTGCCAGGCTTGCCGGAGGGTGCCTTCA 

GGGAGGTGCCCCTGCTGCAGTCGCTGTGGCTGGCACACAATGAGATCCGCACGGTGGCCGCCGGAGCCCTGGCCT 

CTCTGAGCCATCTCAAGAGCCTGGACCTCAGCCACAATCTCATCTCTGACTT.TGCCTGGAGCGACCTGCACAACC 

TCAGTGCCCTCCAATTGCTCAAGATGGACAGCAACGAGCTGACCTTCATCCCCCGCGACGCCTTCCGCAGCCTCC 

GTGCTCTGCGCTCGCTGCAACTCAACCACAACCGCTTGCACACATTGGCCGAGGGCACCTTCACCCCGCTCACCG 

CGCTGTCCCACCTGCAGATCAACGAGAACCCCTTCGACTGCACCTGCGGCATCGTGTGGCTCAAGACATGGGCCC 

TGACCACGGCCGTGTCCATCCCGGAGCAGGACAACATCGCCTGCACCTCACCCCATGTGCTCAAGGGTACACCGC 

TGAGCCGCCTGCCGCCACTGCCATGCTCGGCGCCCTCAGTGCAGCTCAGCTACCAACCCAGCCAGGATGGTGCCG 

AGCTGCGGCCTGGTTTTGTGCTGGCACTGCACTGTGATGTGGACGGGCAGCCGGCCCCTCAGCTTCACTGGCACA 

TCCAGATACCCAGTGGCATTGTGGAGATCACCAGCCCCAACGTGGGCACTGATGGGCGTGCCCTGCCTGGCACCC 

CTGTGGCCAGCTCCCAGCCGCGCTTCCAGGCCTTTGCCAATGGCAGCCTGCTTATCCCCGACTTTGGCAAGCTGG 

AGGAAGGCACCTACAGCTGCCTGGCCACCAATGAGCTGGGCAGTGCTGAGAGCTCAGTGGACGTGGCACTGGCCA 

CGCCCGGTGAGGGTGGTGAGGACACACTGGGGCGCAGGTTCCATGGCAAAGCGGTTGAGGGAAAGGGCTGCTATA 

CGGTTGACAACGAGGTGCAGCCATCAGGGCCGGAGGACAATGTGGTCATCATCTACCTCAGCCGTGCTGGGAACC 

CTGAGGCTGCAGTCGCAGAAGGGGTCCCTGGGCAGCTGCCCCCAGGCCTGCTCCTGCTGGGCCAAAGCCTCCTCC 

TCTTCTTCTTCCTCACCTCCTTCTAGCCCCACCCAGGGCTTCCCTAACTCCTCCCCTTGCCCCTACCAATGCCCC 

TTTAAGTGCTGCAGGGGTCTGGGGTTGGCAACTCCTGAGGCCTGCATGGGTGACTTCACATTTTCCTACCTCTCC 

TTCTAATCTCTTCTAGAGCACCTGCTATCCCCAACTTCTAGACCTGCTCCAAACTAGTGACTAGGATAGAATTTG 

ATCCCCTAACTCACTGTCTGCGGTGCTCATTGCTGCTAACAGCATTGCCTGTGCTCTCCTCTCAGGGGCAGCATG 

CTAACGGGGCGACGTCCTAATCCAACTGGGAGAAGCCTCAGTGGTGGAATTCCAGGCACTGTGACTGTCAAGCTG 

GCAAGGGCCAGGATTGGGGGAATGGAGCTGGGGCTTAGCTGGGAGGTGGTCTGAAGCAGACAGGGAATGGGAGAG 

GAGGATGGGAAGTAGACAGTGGCTGGTATGGCTCTGAGGCTCCCTGGGGCCTGCTCAAGCTCCTCCTGCTCCTTG 

CTGTTTTCTGATGATTTGGGGGCTTGGGAGTCCCTTTGTCCTCATCTGAGACTGAAATGTGGGGATCCAGGATGG 

CCTTCCTTCCTCTTACCCTTCCTCCCTCAGCCTGCAACCTCTATCCTGGAACCTGTCCTCCCTTTCTCCCCAACT 

ATGCATCTGTTGTCTGCTCCTCTGCAAAGGCCAGCCAGCTTGGGAGCAGCAGAGAAATAAACAGCATTTCTGATG 

CCAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCGACCT 
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FIGURE 292 

MQELHLLWWALLLGLAQACPEPCDCGEKYGFQIADCAYRDLESVPPGFPANVTTLSLSANRLP 
GLPEGAFREVPLLQSLWLAHNEIRTVAAGALASLSHLKSLDLSHNLISDFAWSDLHNLSALQL 
LKMDSNELTFIPRDAFRSLRALRSLQLNHNRLHTLAEGTFTPLTALSHLQINENPFDCTCGIV 
WLKTWALTTAVSIPEQDNIACTSPHVLKGTPLSRLPPLPCSAPSVQLSYQPSQDGAELRPGFV 
LALHCDVDGQPAPQLHWHIQIPSGIVEITSPNVGTDGRALPGTPVASSQPRFQAFANGSLLIP 
DFGKLEEGTYSCLATNELGSAESSVDVALATPGEGGEDTLGRRFHGKAVEGKGCYTVDNEVQP 
SGPEDNWIIYLSRAGNPEAAVAEGVPGQLPPGLLLLGQSLLLFFFLTSF 

Important features : 

Signal peptide: 

amino acids 1-18 

Transmembrane domain : 

amino acids 403-418 

N-glycosylation sites: 

Amino acids 51-55,120-124,309-313 

Tyrosine kinase phosphorylation site: 

amino acids 319-326 

N-myristoylation sites: ■ " ■_ 

amino acids 14-2(3, 64-70, 92-98,218-224, 294-300,323-329, 334-340, 

350-356,394-400 . 

Amidation site: 

amino acids 355-359 

Leucine Rich Repeat: 

amino acids 51-74,75-98, 99-122,123-146,147-170 

Leucine rich repeat C- terminal domain: 

amino acids 180-230 
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FIGURE 293 

Srggggcagacacgctcggacccacccgcagacggcccttctgga 

TGGTTTTCATCATTGACAGCTCTCGCAGTGTCAACACC 

TCTTGCAA^TCTTGGACATTGGTCCTGATGTCACCCGAGTGGGCCTGCTCCAATATGGCAGC^ 

agSSc^ScaIgaccttcaagaggaagtccgagg 

rPArCATGACT^GCTGGCCATCCAGTATGCCCTGAACATCGCATTCTCAGAAGCAGA 

gggagSgcScgggtcS 

a?^MGGACACGG€CATCCTAATCTTO 

SSSgagSSmSggacca^ 
a?SaagSctgcacggccc^atgtgcagcaccctg 

PTrrr at ggaggaccac^ctgtgagcagctctgtgtgaatgtgccgg 

rrTACGCCCTGGCTGAGGATGGGAA 

atgaSgtgtaaatgctgatggctcctacctttgccagtgccatgaagg 

cgtgScaIggatcaactactgtgc^^ 
aotacJgc^gSgccaccgtgg^ 

RCCAGGACCATGGCTGTGAGCAGCT^ 

^ScScgaggaccSaagac^ 

G^TCAACATGGACAGATCCTTTGCCTGTCAGTGTCC 

SaaaotSac??ttgtgcS 

^rr^TrrTTTrAArGTTATATACTCCGTGAAGATGGAAAAACCTGCAGAAGGAAA 
ACCATGGCTGTGAACACATTTGT^ 

gagaagScS 
aS^ccgaotgggccagtaaag^ 

ArGAGGAACTACAAGAGATTGCCTCTGAGCCCACAAACAAGCA^ 

aISaStaaSgaS^ 

rCTGTTCT^TTTTGCAGTGC^CACAGATATCTGTTTGAAGA^ 

ccca"caa^aaa1cc??SggaaS 

JrCAGAAC^iTGCA^CGAAGAAGTAAGAAAATTAA^ 
TrrAAAATCGCCTGAGATACAGA 

r n n rr^rT CC AGAGCCCC A^GCTCAGGCT^ TGTT AAAT C AAT AATGTT GT GAAGT AAAAC AATC AGT ACT GA 

aSStgStctgcctcatcctS 
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FIGURE 294 

MEKMLAGCFLLILGQIVLLPAEARERSRGRSISRGRHARTHPQTALLESSCENKRADLVFIID 
SSRSVNTHDYAKVKEFIVDILQFLDIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKR 
MRHLSTGTMTGLAIQYALNIAFSEAEGARPLRENVPRVIMIVTDGRPQDSVAEVAAKARDTGI 
LIFAIGVGQVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQKKLCTAHMCSTLEHNCAHF 
CINIPGSYVCRCKQGYILNSDQTTCRIQDLCAMEDHNCEQLCVNVPGSFVCQCYSGYALAEDG 
KRCVAVDYCASENHGCEHECVNADGSYLCQCHEGFALNPDEKTCTRINYCALNKPGCEHECVN 
MEESYYCRCHRGYTLDPNGKTCSRVDHCAQQDHGCEQLCLNTEDSFVCQCSEGFLINEDLKTC 
SRVDYCLLSDHGCEYSCVNMDRSFACQCPEGHVLRSDGKTCAKLDSCALGDHGCEHSCVSSED 
SFVCQCFEGYILREDGKTCRRKDVCQAIDHGCEHICVNSDDSYTCECLEGFRLAEDGKRCRRK 
DVCKSTHHGCEHICVNNGNSYICKCSEGFVLAEDGRRCKKCTEGPIDLVFVIDGSKSLGEENF 

E WKQ FVTG 1 1 DSLT I S PKAARVGLLQY STQVHTE FTLRN FN S AKDMKKAVAHMK YMGKGSMT 
GLALKHMFERSFTQGEGARPLSTRVPRAAIVFTDGRAQDDVSEWASKAKANGITMYAVGVGKA 
IEEELQEIASEPTNKHLFYAEDFSTMDEISEKLKKGICEALEDSDGRQDSPAGELPKTVQQPT 
ESEPVTINIQDLLSCSNFAVQHRYLFEEDNLLRSTQKLSHSTKPSGSPLEEKHDQCKCENLIM 

FQNLANEEVRKLTQRLEEMTQRMEALENRLRYR 

Important features: 
Signal sequence: 

Amino acids 1-23 

N-glycosylation site: 

Amino acids 221-225 

cAMP- and cGMP-dependent protein kinase phosphorylation sites: 

Amino acids 115-119; 606-610; 892-896 

N-myristoylation sites: ' * 

Amino acids 133-139;258-264;299-305; 340-346; 453-459; 494-500; 

639-645;690-694; 
752-758;792-798 

Amidation sites: 

Amino acids 314-318; 560-564; 601-605 

Aspartic acid and asparagine hydroxylation sites: 

Amino acids 253-265; 294-306; 335-347 ; 376-388 ; 417-429; 
458-470; 540-552; 581-593 



WO 01/40466 



PCT/US00/32678 



295/550 

FIGURE 295 

GGCCGGAGCAGCACGGCCGCAGGACCTGGAGCTCCGGCTGCGTCTTCCCGCAGCGCTACCCGC 
CATGCGCCTGCCGCGCCGGGCCGCGCTGGGGCTCCTGCCGCTTCTGCTGCTGCTGCCGCCCGC 
GCCGGAGGCCGCCAAGAAGCCGACGCCCTGCCACCGGTGCCGGGGGCTGGTGGACAAGTTTAA 
CCAGGGGATGGTGGACACCGCAAAGAAGAACTTTGGCGGCGGGAACACGGCTTGGGAGGAAAA 
GACGCTGTCCAAGTACGAGTCCAGCGAGATTCGCCTGCTGGAGATCCTGGAGGGGCTGTGCGA 
GAGCAGCGACTTCGAATGCAATCAGATGCTAGAGGCGCAGGAGGAGCACCTGGAGGCCTGGTG 
GCTGCAGCTGAAGAGCGAATATCCTGACTTATTCGAGTGGTTTTGTGTGAAGACACTGAAAGT 
GTGCTGCTCTCCAGGAACCTACGGTCCCGACTGTCTCGCATGCCAGGGCGGATCCCAGAGGCC 
CTGCAGCGGGAATGGCCACTGCAGCGGAGATGGGAGCAGACAGGGCGACGGGTCCTGCCGGTG 
CCACATGGGGTACCAGGGCCCGCTGTGCACTGACTGCATGGACGGCTACTTCAGCTCGCTCCG 
GAACGAGACCCACAGCATCTGCACAGCCTGTGACGAGTCCTGCAAGACGTGCTCGGGCCTGAC 
CAACAGAGACTGCGGCGAGTGTGAAGTGGGCTGGGTGCTGGACGAGGGCGCCTGTGTGGATGT 
GGACGAGTGTGCGGCCGAGCCGCCTCCCTGCAGCGCTGCGCAGTTCTGTAAGAACGCCAACGG 
CTCCTACACGTGCGAAGAGTGTGACTCCAGCTGTGTGGGCTGCACAGGGGAAGGCCCAGGAAA 
CTGTAAAGAGTGTATCTCTGGCTACGCGAGGGAGCACGGACAGTGTGCAGATGTGGACGAGTG 
CTCACTAGCAGAAAAAACCTGTGTGAGGAAAAACGAAAACTGCTACAATACTCCAGGGAGCTA 
CGTCTGTGTGTGTCCTGACGGCTTCGAAGAAACGGAAGATGCCTGTGTGCCGCCGGCAGAGGC 
TGAAGCCACAGAAGGAGAAAGCCCGACACAGCTGCCCTCCCGCGAAGACCTGTAATGTGCCGG 
ACTTACCCTTTAAATTATTCAGAAGGATGTCCCGTGGAAAATGTGGCCCTGAGGATGCCGTCT 
CCTGCAGTGGACAGCGGCGGGGAGAGGCTGCCTGCTCTCTAACGGTTGATTCTCATTTGTCCC 
TTAAACAGCTGCATTTCTTGGTTGTTCTTAAACAGACTTGTATATTTTGATACAGTTCTTTGT 
AATAAAATTGACCATTGTAGGTAATCAGGAGGAAAAAAAAA 
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FIGURE 296 

MRLPRRAALGLLPLLLLLPPAPEAAKKPTPCHRCRGLVDKFNQGMVDTAKKNFGGGNTAWEEK 
TLSKYESSEIRLLEILEGLCESSDFECNQMLEAQEEHLEAWWLQLKSEYPDLFEWFCVKTLKV 
CCSPGTYGPDCLACQGGSQRPCSGNGHCSGDGSRQGDGSCRCHMGYQGPLCTDCMDGYFSSLR 
NETHSICTACDESCKTCSGLTNRDCGECEVGWVLDEGACVDVDECAAEPPPCSAAQFCKNANG 
SYTCEECDSSCVGCTGEGPGNCKECISGYAREHGQCADVDECSLAEKTCVRKNENCYNTPGSY 

VCVCPDGFEETEDACVPPAEAEATEGESPTQLPSREDL 

Important features: 

Signal peptide: 

Amino acids 1-24 

N-glycosylation sites: 

Amino acids 190-194 ; 251-255 

Glycosaminoglycan attachment sites: 

Amino acids 149-153; 155-159 

cAMP- and cGMP-dependent protein kinase phosphorylation site: 

Amino acids 26-30 

Tyrosine kinase phosphorylation site: 

Amino acids 303-310 

^ i *^f°:4- i 5 r54-60;55-61;81-87,150-l 5 6; 15 8- 1 64,. l 64-170; 
252.-258;313-319 

Aspartic acid and asparagine hydroxy lation site: 

Amino acids 308-320 

EGF-like domain cysteine pattern signature: 

Amino acids 166-178 



Leucine zipper pattern: 

Amino acids 94-116 
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FIGURE 297 

GACATCGGAGGTGGGCTAGCACTGAAACTGCTTTT.CAAGACGAGGAAGAGGAGGAGAAAGAGAAAGAAGAGGAAG 

ATGTTGGGCAACATTTATTTAACATGCTCCACAGCCCGGACCCTGGCATCATGCTGCTATTCCTGCAAATACTGA 

AGAAGCATGGGATTTAAATATTTTACTTCTAAATAAATGAATTACTCAATCTCCTATGACCATCTATACATACTC 

CACCTTCAAAAAGTACATCAATATTATATCATTAAGGAAATAGTAACCTTCTCTTCTCCAATATGCATGACATTT 

TTGGACAATGCAATTGTGGCACTGGCACTTATTTCAGTGAAGAAAAACTTTGTGGTTCTATGGCATTCATCATTT 

GACAAATGCAAGCATCTTCCTTATCAATCAGCTCCTATTGAACTTACTAGCACTGACTGTGGAATCCTTAAGGGC 

CCATTACATTTCTGAAGAAGAAAGCTAAGATGAAGGACATGCCACTCCGAATTCATGTGCTACTTGGCCTAGCTA 

TCACTACACTAGTACAAGCTGTAGATAAAAAAGTGGATTGTCCACGGTTATGTACGTGTGAAATCAGGCCTTGGT 

TTACACCCAGATCCATTTATATGGAAGCATCTACAGTGGATTGTAATGATTTAGGTCTTTTAACTTTCCCAGCCA 

GATTGCCAGCTAACACACAGATTCTTCTCCTACAGACTAACAATATTGCAAAAATTGAATACTCCACAGACTTTC 

CAGTAAACCTTACTGGCCTGGATTTATCTCAAAACAATTTATCTTCAGTCACCAATATTAATGTAAAAAAGATGC 

CTCAGCTCCTTTCTGTGTACCTAGAGGAAAACAAACTTACTGAACTGCCTGAAAAATGTCTGTCCGAACTGAGCA 

ACTTACAAGAACTCTATATTAATCACAACTTGCTTTCTACAATTTCACCTGGAGCCTTTATTGGCCTACATAATC 

TTCTTCGACTTCATCTCAATTCAAATAGATTGCAGATGATCAACAGTAAGTGGTTTGATGCTCTTCCAAATCTAG 

AGATTCTGATGATTGGGGAAAATCCAATTATCAGAATCAAAGACATGAACTTTAAGCCTCTTATCAATCTTCGCA 

GCCTGGTTATAGCTGGTATAAACCTCACAGAAATACCAGATAACGCCTTGGTTGGACTGGAAAACTTAGAAAGCA 

TCTCTTTTTACGATAACAGGCTTATTAAAGTACCCCATGTTGCTCTTCAAAAAGTTGTAAATCTCAAATTTTTGG 

ATCTAAATAAAAATCCTATTAATAGAATACGAAGGGGTGATTTTAGCAATATGCTACACTTAAAAGAGTTGGGGA 

TAAATAATATGCCTGAGCTGATTTCCATCGATAGTCTTGCTGTGGATAACCTGCCAGATTTAAGAAAAATAGAAG 

CTACTAACAACCCTAGATTGTCTTACATTCACCCCAATGCATTTTTCAGACTCCCCAAGCTGGAATCACTCATGC 

TGAACAGCAATGCTCTCAGTGCCCTGTACCATGGTACCATTGAGTCTCTGCCAAACCTCAAGGAAATCAGCATAC 

ACAGTAACCCCATCAGGTGTGACTGTGTCATCCGTTGGATGAACATGAACAAAACCAACATTCGATTCATGGAGC 

CAGATTCACTGTTTTGCGTGGACCCACCTGAATTCCAAGGTCAGAATGTTCGGCAAGTGCATTTCAGGGACATGA 

TGGAAATTTGTCTCCCTCTTATAGCTCCTGAGAGCTTTCCTTCTAATCTAAATGTAGAAGCTGGGAGCTATGTTT 

CCTTTCACTGTAGAGCTACTGCAGAACCACAGCCTGAAATCTACTGGATAACACCTTCTGGTCAAAAACTCTTGC 

CTAATACCCTGACAGACAAGTTCTATGTCCATTCTGAGGGAACACTAGATATAAATGGCGTAACTCCCAAAGAAG 

GGGGTTTATATACTTGTATAGCAACTAACCTAGTTGGCGCTGACTTGAAGTCTGTTATGATCAAAGTGGATGGAT 

CTTTTCCACAAGATAACAATGGCTCTTTGAATATTAAAATAAGAGATATTCAGGCCAATTCAGTTTTGGTGTCCT 

GGAAAGCAAGTTCTAAAATTCTCAAATCTAGTGTTAAATGGACAGCCTTTGTCAAGACTGAAAATTCTCATGCTG 

CGCAAAGTGCTCGAATACCATCTGATGTCAAGGTATATAATCTTACTCATCTGAATCCATCAACTGAGTATAAAA 

TTTGTATTGATATTCCCACCATCTATCAGAAAAACAGAAAAAAATGTGTAAATGTCACCACCAAAGGTTTGCACC % 

CTGATCAAAAAGAGTATGAAAAGAATAATACCACAACACTTATGGCCTGTCTTGGAGGCCTTCTGGGGATTATTG 

GTGTGATATGTCTTATCAGCTGCCTCTCTCCAGAAATGAACTGTGATGGTGGACACAGCTATGTGAGGAATTACT 

TACAGAAACCAACCTTTGCATTAGGTGAGCTTTATCCTCCTCTGATAAATCTCTGGGAAGCAGGAAAAGAAAAAA 

GTACATCACTGAAAGTAAAAGCAACTGTTATAGGTTTACCAACAAATATGTCCTAAAAACCACCAAGGAAACCTA 

CTCC AAAAAT GAAC 
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FIGURE 298 

MKDMPLRIHVLLGLAITTLVQAVDKKVDCPRLCTCEIRPWFTPRSIYMEASTVDCNDLGLLTF 
PARLPANTQILLLQTNNIAKIEYSTDFPVNLTGLDLSQNNLSSVTNINVKKMPQLLSVYLEEN 
KLTELPEKCLSELSNLQELYINHNLLSTISPGAFIGLHNLLRLHLNSNRLQMINSKWFDALPN 
LEILMIGENPIIRIKDMNFKPLINLRSLVIAGINLTEIPDNALVGLENLESISFYDNRLIKVP 
HVALQKVVNLKFLDLNKNPINRIRRGDFSNMLHLKELGINNMPELISIDSLAVDNLPDLRKIE 
ATNNPRLSYIHPNAFFRLPKLESLMLNSNALSALYHGTIESLPNLKEISIHSNPIRCDCVIRW 
MNMNKTNIRFMEPDSLFCVDPPEFQGQNVRQVHFRDMMEICLPLIAPESFPSNLNVEAGSYVS 
FHCRATAEPQPEIYWITPSGQKLLPNTLTDKFYVHSEGTLDINGVTPKEGGLYTCIATNLVGA 
DLKSVMIKVDGSFPQDNNGSLNIKIRDIQANSVLVSWKASSKILKSSVKWTAFVKTENSHAAQ 
SARIPSDVKVYNLTHLNPSTEYKICIDIPTIYQKNRKKCVNVTTKGLHPDQKEYEKNNTTTLM 
ACLGGLLGIIGVICLISCLSPEMNCDGGHSYVRNYLQKPTFALGELYPPLINLWEAGKEKSTS 

LKVKATVIGLPTNMS 

Important features: 

Signal sequence: 

amino acids 1-22 

Transmembrane domain : 

amino acids 633-650 

a;Sr:Sf»-97^O3-107, 223-227. 382-386, 522-526, 579-583, 
608-612/ 624-628, 625-629 

Casein kinase II phosphorylation site. 

amino acids 51-55, 95-99, 242-246, 468-472, 487-491 

Tyrosine kinase phosphorylation site. 

amino acids 570-57 9 

N-myristoylation site. - _ A - n AA ~ 

amino acids 13-19, 96-102, 158-164, 221-227, 352-358, 437-443, 
491-497, 492-498, 634-640, 702-708 



Cell attachment sequence. 

amino acids 277-280 
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FIGURE 299 

GCTGTGGGAACCTCTCCACGCGCACGAACTCAGCCAACGATTTCTGATAGATTTTTGGGAGTT 
TGACCAGAGATGCAAGGGGTGAAGGAGCGCTTCCTACCGTTAGGGAACTCTGGGGACAGAGCG 
CCCCGGCCGCCTGATGGCCGAGGCAGGGTGCGACCCAGGACCCAGGACGGCGTCGGGAACCAT 
ACCATGGCCCGGATCCCCAAGACCCTAAAGTTCGTCGTCGTCATCGTCGCGGTCCTGCTGCCA 
GTCCTAGCTTACTCTGCCACCACTGCCCGGCAGGAGGAAGTTCCCCAGCAGACAGTGGCCCCA 
CAGCAACAGAGGCACAGCTTCAAGGGGGAGGAGTGTCCAGCAGGATCTCATAGATCAGAACAT 
ACTGGAGCCTGTAACCCGTGCACAGAGGGTGTGGATTACACCAACGCTTCCAACAATGAACCT 
TCTTGCTTCCCATGTACAGTTTGTAAATCAGATCAAAAACATAAAAGTTCCTGCACCATGACC 
AGAGACACAGTGTGTCAGTGTAAAGAAGGCACCTTCCGGAATGAAAACTCCCCAGAGATGTGC 
CGGAAGTGTAGCAGGTGCCCTAGTGGGGAAGTCCAAGTCAGTAATTGTACGTCCTGGGATGAT 
ATCCAGTGTGTTGAAGAATTTGGTGCCAATGCCACTGTGGAAACCCCAGCTGCTGAAGAGACA 
ATGAACACCAGCCCGGGGACTCCTGCCCCAGCTGCTGAAGAGACAATGAACACCAGCCCAGGG 
ACTCCTGCCCCAGCTGCTGAAGAGACAATGACCACCAGCCCGGGGACTCCTGCCCCAGCTGCT 
GAAGAGACAATGACCACCAGCCCGGGGACTCCTGCCCCAGCTGCTGAAGAGACAATGACCACC 
AGCCCGGGGACTCCTGCCTCTTCTCATTACCTCTCATGCACCATCGTAGGGATCATAGTTCTA 
ATTGTGCTTCTGATTGTGTTTGTTTGAAAGACTTCACTGTGGAAGAAATTCCTTCCTTACCTG 
AAAGGTTCAGGTAGGCGCTGGCTGAGGGCGGGGGGCGCTGGACACTCTCTGCCCTGCCTCCCT 
CTGCTGTGTTCCCACAGACAGAAACGCCTGC 
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300/550 

FIGURE 300 

MARIPKTLKFVVVIVAVLLPVLAYSATTARQEEVPQQTVAPQQQRHSFKGEECPAGSHRSEHT 
GACNPCTEGVDYTNASNNEPSCFPCTVCKSDQKHKSSCTMTRDTVCQCKEGTFRNENSPEMCR 
KCSRCPSGEVQVSNCTSWDDIQCVEEFGANATVETPAAEETMNTSPGTPAPAAEETMNTSPGT 
PAPAAEETMTTSPGTPAPAAEETMTTSPGTPAPAAEETMTTSPGTPASSHYLSCTIVGIIVLI 
VLLIVFV 

Important features: 
Signal peptide: 

Amino acids 1-29 

Transmembrane domain: 

Amino acids 240-259 

N-glycosylation site: 

Amino acids 77-81;140-144; 156-160 

cAMP- and cGMP- dependent protein kinase phosphorylation site: 

Amino acids 126-130 

N-myristoylation sites: 

Amino acids 56-62 ; 72-7 8 ; 114-120; 154-160 ; 233-239 
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FTGURE 301 

CACAAGCATCTTAATTTGAATCCACAAAGTTTCATGTAATGAAAAGAAATACATAATTTTAAT 

TCAACCCGAGTGTTTTCCAAGAAGATTGTATTTGCTTAAATTGCTACAGTAATTCAAGAGACA 

GCCCTGTCTGGACACAGAGTTACTGTGGATTTTTAAGAGACTCAGTTAAAGAATTTAGGAATT 

TCTGATTCATTTAAAGGATTTACAAATTCATCAACCCCTGAAAACTAAAGCAAATTGAACAGG 

AAAAAAAAAAAGAAGAT6GGTTTTTTAAGTCCAATATATGTTATTTTCTTCTTTT.TTGGAGTC 

AAAGTACATTGCCAATATGAAACTTATCAGTGGGATGAAGACTATGACCAAGAGCCAGATGAT 

GATTACCAAACAGGATTCCCATTTCGTCAAAATGTAGACTACGGAGTTCCTTTTCATCAGTAT 

ACTTTAGGCTGTGTCAGTGAATGCTTCTGTCCAACTAACTTTCCATCATCAATGTACTGTGAT 

AATCGCAAACTCAAGACTATCCCAAATATTCCGATGCACATTCAGCAACTCTACCTTCAGTTC 

AATGAAATTGAGGCTGTGACTGCAAATTCATTCATCAATGCAACTCATCTTAAAGAAATTAAC 

CTCAGCCACAACAAAATTAAATCTCAAAAGATTGATTATGGTGTGTTTGCTAAGCTTCCAAAT 

CTACTACAACTTCATCTAGAGCATAATAATTTAGAAGAATTTCCATTTCCTCTTCCTAAATCT 

CTGGAAAGACTCCTTCTTGGTTACAATGAAATCTCCAAACTGCAGACAAATGCTATGGATGGG 

CTAGTAAACTTGACCATGCTTGATCTCTGTTATAATTATCTTCATGATTCTCTGCTAAAAGAC 

AAAATCTTTGCCAAAATGGAAAAACTAATGCAGCTCAACCTCTGCAGTAACAGATTAGAATCA 

ATGCCTCCTGGTTTGCCTTCTTCACTTATGTATCTGTCTTTAGAAAATAATTCAATTTCTTCT 

ATACCCGAAAAATACTTCGACAAACTTCCAAAACTTCATACTCTAAGAATGTCACACAACAAA 

CTACAAGACATCCCATATAATATTTTTAATCTTCCCAACATTGTAGAACTCAGTGTTGGACAC 

AACAAATTGAAGCAAGCATTCTATATTCCAAGAAATTTGGAACACCTATACCTACAAAATAAT 

GAAATAGAAAAGATGAATCTTACAGTGATGTGTCCTTCTATTGACCCACTACATTACCACCAT 

TTAACATACATTCGTGTGGACCAAAATAAACTAAAAGAACCAATAAGCTCATACATCTTCTTC 

TGCTTCCCTCATATACACACTATTTATTATGGTGAACAACGAAGCACTAATGGTCAAACAATA 

CAACTAAAGACACAAGTTTTCAGGAGATTTCCAGATGATGATGATGAAAGTGAAGATCACGAT 

GATCCTGACAATGCTCATGAGAGCCCAGAACAAGAAGGAGCAGAAGGGCACTTTGACCTTCAT 

TATTATGAAAATCAAGAATAGCAAGAAACTATATAGGTATACACTTACGACTTCACAAAACCTA 

TACTTAATATAGTAAATCTAAGTAAACATGTATTACTCAAAGTAATATATTTAGAATTATGTA 

TTAGTATAAGATCAGAATTGAATTTAAGTTGTTGGTGACATCTGCATCATTTCATAGGATTAG 

AACTTACTCAAAATAATGTAAATCTTTAAAAATATAAATTAGAATGACAAGTGGGAATCATAA 

ATTAAACGTTAATGGTTTCTTATGCTCTTTTTAAATATAGAAATATCATGTTAAAGAAAAAAA 



AAAAAAA 
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302/550 

FIGURE 302 

MGFLSPIYVIFFFFGVKVHCQYETYQWDEDYDQEPDDDYQTGFPFRQNVDYGVPFHQYTLGCV 
SECFCPTNFPSSMYCDNRKLKTIPNIPMHIQQLYLQFNEIEAVTANSFINATHLKEINLSHNK 
IKSQKIDYGVFAKLPNLLQLHLEHNNLEEFPFPLPKSLERLLLGYNEISKLQTNAMDGLVNLT 
MLDLCYNYLHDSLLKDKIFAKMEKLMQLNLCSNRLESMPPGLPSSLMYLSLENNSISSIPEKY 
FDKLPKLHTLRMSHNKLQDIPYNIFNLPNIVELSVGHNKLKQAFYIPRNLEHLYLQNNEIEKM 
NLTVMCPSIDPLHYHHLTYIRVDQNKLKEPISSYIFFCFPHIHTIYYGEQRSTNGQTIQLKTQ 
VFRRFPDDDDESEDHDDPDNAHESPEQEGAEGHFDLHYYENQE 

Important feastures : 
N-glycosylation sites : 

Amino acids 113-117 ; 121-125; 187-191; 242-246; 316-320 

Tyrosine kinase phosphorylation sites: 

Amino acids 268-275; 300-307 

N-myristoylation site: 

Amino acids 230-236 



Leucine zipper patterns: 

Amino acids 146-168 ; 217-239 
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FT CURE 303 

GCCCGGGAC.TGGCGCAAGGTGCCCAAGCAAGGAAAGAAATAATGAAGAGACACATGTGTTAGC 

TGCAGCCTTTTGAAACACGCAAGAAGGAAATCAATAGTGTGGACAGGGCTGGAACCTTTACCA 

CGCTTGTTGGAGTAGATGAGGAATGGGCTCGTGATTATGCTGACATTCCAGCATGAATCTGGT 

AGACCTGTGGTTAACCCGTTCCCTCTCCATGTGTCTCCTCCTACAAAGTTTTGTTCTTATGAT 

ACTGTGCTTTCATTCTGCCAGTATGTGTCCCAAGGGCTGTCTTTGTTCTTCCTCTGGGGGTTT 

AAATGTCACCTGTAGCAATGCAAATCTCAAGGAAATACCTAGAGATCTTCCTCCTGAAACAGT 

CTTACTGTATCTGGACTCCAATCAGATCACATCTATTCCCAATGAAATTTTTAAGGACCTCCA 

TCAACTGAGAGTTCTCAACCTGTCCAAAAATGGCATTGAGTTTATCGATGAGCATGCCTTCAA 

AGGAGTAGCTGAAACCTTGCAGACTCTGGACTTGTCCGACAATCGGATTCAAAGTGTGCACAA 

AAATGCCTTCAATAACCTGAAGGCCAGGGCCAGAATTGCCAACAACCCCTGGCACTGCGACTG 

TACTCTACAGCAAGTTCTGAGGAGCATGGCGTCCAATCATGAGACAGCCCACAACGTGATCTG 

TAAAACGTCCGTGTTGGATGAACATGCTGGCAGACCATTCCTCAATGCTGCCAACGACGCTGA 

CCTTTGTAACCTCCCTAAAAAAACTACCGATTATGCCATGCTGGTCACCATGTTTGGCTGGTT 

CACTATGGTGATCTCATATGTGGTATATTATGTGAGGCAAAATCAGGAGGATGCCCGGAGACA 

CCTCGAATACTTGAAATCCCTGCCAAGCAGGCAGAAGAAAGCAGATGAACCTGATGATATTAG 

CACTGTGGTATA6TGTCCAAACTGACTGTCATTGAGAAAGAAAGAAAGTAGTTTGCGATTGCA 

GTAGAAATAAGTGGTTTACTTCTCCCATCCATTGTAAACATTTGAAACTTTGTATTTCAGTTT 

TTTTTGAATTATGCCACTGCTGAACTTTTAACAAACACTACAACATAAATAATTTGAGTTTAG 

GTGATCCACCCCTTAATTGTACCCCCGATGGTATATTTCTGAGTAAGCTACTATCTGAACATT 

AGTTAGATCCATCTCACTATTTAATAATGAAATTTATTTTTTTAATTTAAAAGCAAATAAAAG 

CTTAACTTTGAACCATGGGAAAAAAAAAAAAAAAAAAAAAAAACA 
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FIGURE 304 

MNLVDLWLTRSLSMCLLLQSFVLMILCFHSASMCPKGCLCSSSGGLNVTCSNANLKEIPRDLP 
PETVLLYLDSNQITSIPNEIFKDLHQLRVLNLSKNGIEFIDEHAFKGVAETLQTLDLSDNRIQ 
SVHKNAFNNLKARARIANNPWHCDCTLQQVLRSMASNHETAHNVICKTSVLDEHAGRPFLNAA 
NDADLCNLPKKTTDYAMLVTMFGWFTMVISYVVYYVRQNQEDARRHLEYLKSLPSRQKKADEP 

DDISTW 

Important features : 
Signal sequence: 

Amino acids 1-33 

Transmembrane domain : 

Amino acids 204-219 

N-glycosylation sites: 

Amino acids 47-51; 94-98 

cAMP- and cGMP-dependent protein kinase phosphorylation site: 

Amino acids 199-203 

Casein kinase II phosphorylation site. 

amino acids 162-166, 175-179 



N-myristoylation sites : 

Amino acids 37-43; 45-51; 110-116 
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FIGURE 305 

S^gtaaaScSSgaataattttgttcaaagggata 

^Sca^Saca^ctcatgcacactgttgaacaagctactttaaggatatcccagagcttccaaaaga 

CCACAGAGTTTG^TACAAATTCT^ 

rTA(^CTATTTCTTGCT<^ 
TTGCCGGACTGCTACAGTACTTCTTTTTM 

Sg?5g?ggSS?ScScaaSaS 

EE 

CATCAAATTAT^AATTATTAACTACTAGACAAAAAGTAT^ 
rTAGATAATAAGGTAAAATTATGTATC^ 

a?JcgagSmatgaa^tcctgaaggaaaccac 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
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306/550 
FTGURE 306 

MKRLPLLWFSTLLNCSYTQNCTKTPCLPNAKCEIRNGIEACYCNMGFSGNGVTICEDDNECGNLTQSCGENANC 
TNTEGSYYCMCVPGFRSSSNQDRFITNDGTVCIENVNANCHLDNVCIAANINKTLTKIRSIKEPVALLQEVYRNS 
VTDLSPTDIITYIEILAESSSLLGYKNNTISAKDTLSNSTLTEFVKTVNNFVQRDTFVVWDKLSVNHRRTHLTKL 

MHTVEQATLRI SQS FQKTTEFDTNSTD IALKVFFFDS YNMKHIHPHMNMDGDY IN I FPKRKAAYDSNGNVAVAFL 
YYKSIGPLLSSSDNFLLKPQNYDNSEEEERVISSVISVSMSSNPPTLYELEKITFTLSHRKVTDRYRSLCAFVfNY 
SPDTMNGSWSSEGCELTYSNETHTSCRCNHLTHFAILMSSGPSIGIKDYNILTRITQLGIIISLICLAICIFTFW 
FFSEIQSTRTTIHKNLCCSLFLAELVFLVGINTNTNKLFCSIIAGLLHYFFLAAFAWMCIEGIHLYLIWGVIYN 
KGFLHKNFYIFGYLSPAVWGFSAALGYRYYGTTKVCWLSTENNFIWSFIGPACLIILVNLLAFGVIIYKVFRHT 
AGLKPEVSCFENIRSCARGALALLFLLGTTWIFGVLHWHASVVTAYLFTVSNAFQGMFIFLFLCVLSRKIQEEY 

YRLFKNVPCCFGCLR 

Important features: 
Signal peptide: 

Amino acids 1-19 

Transmembrane domain : 

Amino acids 431-450; 494-515; 573-594 ; 619-636; 646-664 
N-glycosylation sites: 

Amino acids 15-19; 21-25; 64-68; 74-78; 127-131; 177-181; 
188-192; 249-253; 381-385; 395-399 

Glycosaminoglycan attachment site: 

Amino acids 4 9-53 

cAMP- and cQ4P- dependent protein kinase phosphorylation site: 

Amino acids 360-364 

Tyrosine kinase phosphorylation sites: 

Amino acids 36-44 ; 670-677 

N-myristoylation sites : 

Amino acids 38-44 ; 50-56; 52-58 ; 80-86; 382-388 ; 388-394 ; 
434-440; 4 80-4 86; 521-527 



Aspartic acid and asparagine hydroxylation 

Amino acids 75-87 
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307/550 

FIGURE 307 

CCAGGCCGGGAGGCGACGCGCCCAGCCGTCTAAACGGGAACAGCCCTGGCTGAGGGAGCTGCAGCGCAGCAGAGT 
ATCTGACGGCGCCAGGTTGCGTAGGTGCGGCACGAGGAGTTTTCCCGGCAGCGAGGAGGTCCTGAGCAGCAT6GC 
CCGGAGGAGCGCCTTCCCTGCCGCCGCGCTCTGGCTCTGGAGCATCCTCCTGTGCCTGCTGGCACTGCGGGCGGA 
GGCCGGGCCGCCGCAGGAGGAGAGCCTGTACCTATGGATCGATGCTCACCAGGCAAGAGTACTCATAGGATTTGA 
AGAAGATATCCTGATTGTTTCAGAGGGGAAAATGGCACCTTTTACACATGATTTCAGAAAAGCGCAACAGAGAAT 
GCCAGCTATTCCTGTCAATATCCATTCCATGAATTTTACCTGGCAAGCTGCAGGGCAGGCAGAATACTTCTATGA 
ATTCCTGTCCTTGCGCTCCCTGGATAAAGGCATCATGGCAGATCCAACCGTCAATGTCCCTCTGCTGGGAACAGT 
GCCTCACAAGGCATCAGTTGTTCAAGTTGGTTTCCCATGTCTTGGAAAACAGGATGGGGTGGCAGCATTTGAAGT 
GGATGTGATTGTTATGAATTCTGAAGGCAACACCATTCTCCAAACACCTCAAAATGCTATCTTCTTTAAAACATG 
TCAACMGCTGAGTGCCCAGGCGGGTGCCGAAATGGAGGCTTTTGTAATGAAAGACGCATCTGCGAGTGTCCTGA 
TGGGTTCCACGGACCTCACTGTGAGAAAGCCCTTTGTACCCCACGATGTATGAATGGTGGACTTTGTGTGACTCC 
TGGTTTCTGCATCTGCCCACCTGGATTCTATGGAGTGAACTGTGACAAAGCAAACTGCTCAACCACCTGCTTTAA 
TGGAGGGACCTGTTTCTACCCTGGAAAATGTATTTGCCCTCCAGGACTAGAGGGAGAGCAGTGTGAAATCAGCAA 
ATGCCCACAACCCTGTCGAAATGGAGGTAAATGCATTGGTAAAAGCAAATGTAAGTGTTCCAAAGGTTACCAGGG 
AGACCTCTGTTCAAAGCCTGTCTGCGAGCCTGGCTGTGGTGCACATGGAACCTGCCATGAACCCAACAAATGCCA 
ATGTCAAGAAGGTTGGCATGGAAGACACTGCAATAAAAGGTACGAAGCCAGCCTCATACATGCCCTGAGGCCAGC 
AGGCGCCCAGCTCAGGCAGCACACGCCTTCACTTAAAAAGGCCGAGGAGCGGCGGGATCCACCTGAATCCAATTA 
CATCTGGTGAACTCCGACATCTGAAACGTTTTAAGTTACACCAAGTTCATAGCCTTTGTT7VACCTTTCATGTGTT 
GAATGTTCAAATAATGTTCATTACACTTAAGAATACTGGCCTGAATTTTATTAGCTTCATTATAAATCACTGAGC 
TGATATTTACTCTTCCTTTTAAGTTTTCTAAGTACGTCTGTAGCATGATGGTATAGATTTTCTTGTTTCAGTGCT 
TTGGGACAGATTTTATATTATGTCAATTGATCAGGTTAAAATTTTCAGTGTGTAGTTGGCAGATATTTTCAAAAT 
TACAATGCATTTATGGTGTCTGGGGGCAGGGGAACATCAGAAAGGTTAAATTGGGCAAAAATGCGTAAGTCACAA 
GAATTTGGATGGTGCAGTTAATGTTGAAGTTACAGCATTTCAGATTTTATTGTCAGATATTTAGATGTTTGTTAC 
ATTTTTAAAAATTGCTCTTAATTTTTAAACTCTCAATACAATATATTTTGACCTTACCATTATTCCAGAGATTCA 
GTATTAAAAAAAAAAAAATTACACTGTGGTAGTGGCATTTAAACAATATAATATATTCTAAACACAATGAAATAG 
GGAATATAATGTATGAACTTTTTGCATTGGCTTGAAGCAATATAATATATTGTAAACAAAACACAGCTCTTACCT 
AATAAACATTTTATACTGTTTGTATGTATAAAATAAAGGTGCTGCTTTAGTTTTTTGGAAAAAAAAAAAAAAAAA 
AAAAAAAA 
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FIGURE 308 

MARRSAFPAAALWLWSILLCLLALRAEAGPPQEESLYLWIDAHQARVLIGFEEDILIVSEGKM 
APFTHDFRKAQQRMPAIPVNIHSMNFTWQAAGQAEYFYEFLSLRSLDKGIMADPTVNVPLLGT 
VPHKASVVQVGFPCLGKQDGVAAFEVDVIVMNSEGNTILQTPQNAIFFKTCQQAECPGGCRNG 
GFCNERRICECPDGFHGPHCEKALCTPRCMNGGLCVTPGFCICPPGFYGVNCDKANCSTTCFN 
GGTCFYPGKCICPPGLEGEQCEISKCPQPCRNGGKCIGKSKCKCSKGYQGDLCSKPVCEPGCG 
AHGTCHEPNKCQCQEGWHGRHCNKRYEASLIHALRPAGAQLRQHTPSLKKAEERRDPPESNYIW 

Important features: 
Signal sequence: 

Amino acids 1-28 

N-glycosylation sites: 

Amino acids 88-92 ; 245-24 9 

Tyrosine kinase phosphorylation site: 

Amino acids 370-378 

N-myristoylation sites: 

Amino acids 184-190; 185-191; 189-195; 315-321 

ATP/GTP-binding site motif A (P-loop) : 

Amino acids 285-293 



EGF-like domain cysteine pattern signatures : 

Amino acids 198-210;230-242;262-274;294-306;326-338 
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309/550 

FIGURE 309 



CCCACGCGTCCGGTCTCGCTCGCTCGCGCAGCGGCGGCAGCAGAGGTCGCGCACAGATGCGGG 

TTAGACTGGCGGGGGGAGGAGGCGGAGGAGGGAAGGAAGCTGCATGCATGAGACCCACAGACT 

CTTGCAAGCTGGATGCCCTCTGTGGATGAAAGAT6TATCATGGAATGAACCCGAGCAATGGAG 

ATGGATTTCTAGAGCAGCAGCAGCAGCAGCAGCAACCTCAGTCCCCCCAGAGACTCTTGGCCG 

TGATCCTGTGGTTTCAGCTGGCGCTGTGCTTCGGCCCTGCACAGCTCACGGGCGGGTTCGATG 

ACCTTCAAGTGTGTGCTGACCCCGGCATTCCCGAGAATGGCTTCAGGACCCCCAGCGGAGGGG 

TTTTCTTTGAAGGCTCTGTAGCCCGATTTCACTGCCAAGACGGATTCAAGCTGAAGGGCGCTA 

CAAAGAGACTGTGTTTGAAGCATTTTAATGGAACCCTAGGCTGGATCCCAAGTGATAATTCCA 

TCTGTGTGCAAGAAGATTGCCGTATCCCTCAAATCGAAGATGCTGAGATTCATAACAAGACAT 

ATAGACATGGAGAGAAGCTAATCATCACTTGTCATGAAGGATTCAAGATCCGGTACCCCGACC 

TACACAATATGGTTTCATTATGTCGCGATGATGGAACGTGGAATAATCTGCCCATCTGTCAAG 

GCTGCCTGAGACCTCTAGCCTCTTCTAATGGCTATGTAAACATCTCTGAGCTCCAGACCTCCT 

TCCCGGTGGGGACTGTGATCTCCTATCGCTGCTTTCCCGGATTTAAACTTGATGGGTCTGCGT 

ATCTTGAGTGCTTACAAAACCTTATCTGGTCGTCCAGCCCACCCCGGTGCCTTGCTCTGGAAG 

CCCAAGTCTGTCCACTACCTCCAATGGTGAGTCACGGAGATTTCGTCTGCCACCCGCGGCCTT 

GTGAGCGCTACAACCACGGAACTGTGGTGGAGTTTTACTGCGATCCTGGCTACAGCCTCACCA 

GCGACTACAAGTACATCACCTGCCAGTATGGAGAGTGGTTTCCTTCTTATCAAGTCTACTGCA 

TCAAATCAGAGCAAACGTGGCCCAGCACCCATGAGACCCTCCTGACCACGTGGAAGATTGTGG 

CGTTCACGGCAACCAGTGTGCTGCTGGTGCTGCTGCTCGTCATCCTGGCCAGGATGTTCCAGA 

CCAAGTTCAAGGCCCACTTTCCCCCCAGGGGGCCTCCCCGGAGTTCCAGCAGTGACCCTGACT 

TTGTGGTGGTAGACGGCGTGCCCGTCATGCTCCCGTCCTATGACGAAGCTGTGAGTGGCGGCT 

TGAGTGCCTTAGGCCCCGGGTACATGGCCTCTGTGGGCCAGGGCTGCCCCTTACCCGTGGACG 

ACCAGAGCCCCCCAGCATACCCCGGCTCAGGGGACACGGACACAGGCCCAGGGGAGTCAGAAA 

CCTGTGACAGCGTCTCAGGCTCTTCTGAGCTGCTCCAAAGTCTGTATTCACCTCCCAGGTGCC 

AAGAGAGCACCCACCCTGCTTCGGACAACCCTGACATAATTGCCAGCACGGCAGAGGAGGTGG 

eATCCACCAGCCCAGGCATCCATCATGCCCACTGGGTGTTGTTCCTAAGAAACS^TTGATTA 

AAAAATTTCCCAAAGTGTCCTGAAGTGTCTCTTCAAATACATGTTGATCTGTGGAGTTGATTC 

CTTTCCTTCTCTTGGTTTTAGACAAATGTAAACAAAGCTCTGATCCTTAAAATTGCTATGCTG 

ATAGAGTGGTGAGGGCTGGAAGCTTGATCAAGTCCTGTTTCTTCTTGACACAGACTGATTAAA 

AATTAAAAGNAAAAAA 
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310/550 

FIGURE 310 

MYHGMNPSNGDGFLEQQQQQQQPQSPQRLLAVILWFQLALCFGPAQLTGGFDDLQVCADPGIP 
ENGFRTPSGGVFFEGSVARFHCQDGFKLKGATKRLCLKHFNGTLGWIPSDNSICVQEDCRIPQ 
IEDAEIHNKTYRHGEKLIITCHEGFKIRYPDLHNMVSLCRDDGTWNNLPICQGCLRPLASSNG 
YVNISELQTSFPVGTVISYRCFPGFKLDGSAYLECLQNLIWSSSPPRCLALEAQVCPLPPMVS 
HGDFVCHPRPCERYNHGTVVEFYCDPGYSLTSDYKYITCQYGEWFPSYQVYCIKSEQTWPSTH 
ETLLTTWKIVAFTATSVLLVLLLVILARMFQTKFKAHFPPRGPPRSSSSDPDFVVVDGVPVML 
PSYDEAVSGGLSALGPGYMASVGQGCPLPVDDQSPPAYPGSGDTDTGPGESETCDSVSGSSEL 
LQSLYSPPRCQESTHPASDNPDIIASTAEEVASTSPGIHHAHWVLFLRN 

Important features: 
Signal sequence: 

amino acids 1-41 

Transmembrane domain: 

amino acids 325-344 

N-glycosylation site . 

amino acids 104-108, 134-138, 192-196 
Casein kinase II phosphorylation site. 

amino acids 8-12, 146-150, 252-256, 270-274, 313-317, 362-366, 
364-368, 380-384, 467-471, 468-472 

N-myristoylation site. 

amino acids 4-10, 61-67, 169-175, 203-209, 387-393, 418-424, 
478-484 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 394-405 
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FIGURE 311 

CAGCGCGTGGCCGGCGCCGCTGTGGGGACAGCATSAGCGGCGGTTGGATGGCGCAGGTTGGAG 

CGTGGCGAACAGGGGCTCTGGGCCTGGCGCTGCTGCTGCTGCTCGGCCTCGGACTAGGCCTGG 

AGGCCGCCGCGAGCCCGCTTTCCACCCCGACCTCTGCCCAGGCCGCAGGCCCCAGCTCAGGCT 

CGTGCCCACCCACCAAGTTCCAGTGCCGCACCAGTGGCTTATGCGTGCCCCTCACCTGGCGCT 

GCGACAGGGACTTGGACTGCAGCGATGGCAGCGATGAGGAGGAGTGCAGGATTGAGCCATGTA 

CCCAGAAAGGGCAATGCCCACCGCCCCCTGGCCTCCCCTGCCCCTGCACCGGCGTCAGTGACT 

GCTCTGGGGGAACTGACAAGAAACTGCGCAACTGCAGCCGCCTGGCCTGCCTAGCAGGCGAGC 

TCCGTTGCACGCTGAGCGATGACTGCATTCCACTCACGTGGCGCTGCGACGGCCACCCAGACT 

GTCCCGACTCCAGCGACGAGCTCGGCTGTGGAACCAATGAGATCCTCCCGGAAGGGGATGCCA 

CAACCATGGGGCCCCCTGTGACCCTGGAGAGTGTCACCTCTCTCAGGAATGCCACAACCATGG 

GGCCCCCTGTGACCCTGGAGAGTGTCCCCTCTGTCGGGAATGCCACATCCTCCTCTGCCGGAG 

ACCAGTCTGGAAGCCCAACTGCCTATGGGGTTATTGCAGCTGCTGCGGTGCTCAGTGCAAGCC 

TGGTCACCGCCACCCTCCTCCTTTTGTCCTGGCTCCGAGCCCAGGAGCGCCTCCGCCCACTGG 

GGTTACTGGTGGCCATGAAGGAGTCCCTGCTGCTGTCAGAACAGAAGACCTCGCTGCCCTgAG 

GACAAGCACTTGCCACCACCGTCACTCAGCCCTGGGCGTAGCCGGACAGGAGGAGAGCAGTGA 

TGCGGATGGGTACCCGGGCACACCAGCCCTCAGAGACCTGAGTTCTTCTGGCCACGTGGAACC 

TCGAACCCGAGCTCCTGCAGAAGTGGCCCTGGAGATTGAGGGTCCCTGGACACTCCCTATGGA 

GATCCGGGGAGCTAGGATGGGGAACCTGCCACAGCCAGAACTGAGGGGCTGGCCCCAGGCAGC 

TCCCAGGGGGTAGAACGGCCCTGTGCTTAAGACACTCCCTGCTGCCCCGTCTGAGGGTGGCGA 

TTAAAGTTGCTTC 
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FIGURE 312 

M S GGWMAQVG AWRTG ALGLALLLLLGLGLGLE AAAS PL S T PT S AQAAG PSSGSCPPT KFQC RT 
SGLCVPLTWRCDRDLDCSDGSDEEECRIEPCTQKGQCPPPPGLPCPCTGVSDCSGGTDKKLRN 
CSRLACLAGELRCTLSDDCIPLTWRCDGHPDCPDSSDELGCGTNEILPEGDATTMGPPVTLES 
VTSLRNATTMGPPVTLESVPSVGNATSSSAGDQSGSPTAYGVIAAAAVLSASLVTATLLLLSW 
LRAQERLRPLGLLVAMKESLLLSEQKTSLP 

Important features: 
Signal sequence: 

Amino acids 1-30 

Transmembrane domain: 

Amino acids 231-248 

N-glycosylation sites: 

Amino acids 126-130; 195-199; 213-217 

Casein kinase II phosphorylation site . 

amino acids 84-88, 140-144, 161-165, 218-222 

N-myristoylation sites: 

Amino acids 3-9; 10-16; 26-32 ; 30-36; 112-118 ; 166-172 ; 212-218; 
224-230; 230-236; 263-269 

Prokaryotic .membrane lipoprotein lipid attachment site : 

Amino acids 44-55 



Leucine zipper pattern: 

Amino acids 17-39 
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FIGURE 313 

CGGACGCGTGGGCGTCCGGCGGTCGCAGAGCCAGGAGGCGGAGGCGCGCGGGCCAGCCTGGGCCCCAGCCCACAC 

CTTCACCAGGGCCCAGGAGCCACCMGTGGCGATGTCCACTGGGGCTACTGCTGTTGCTGCCGCTGGCTGGCCAC 

TTGGCTCTGGGTGCCCAGCAGGGTCGTGGGCGCCGGGAGCTAGCACCGGGTCTGCACCTGCGGGGCATCCGGGAC 

GCGGGAGGCCGGTACTGCCAGGAGCAGGACCTGTGCTGCCGCGGCCGTGCCGACGACTGTGCCCTGCCCTACCTG 

GGCGCCATCTGTTACTGTGACCTCTTCTGCAACCGCACGGTCTCCGACTGCTGCCCTGACTTCTGGGACTTCTGC 

CTCGGCGTGCCACCCCCTTTTCCCCCGATCCAAGGATGTATGCATGGAGGTCGTATCTATCCAGTCTTGGGAACG 

TACTGGGACAACTGTAACCGTTGCACCTGCCAGGAGAACAGGCAGTGGCATGGTGGATCCAGACATGATCAAAGC 

CATCAACCAGGGCAACTATGGCTGGCAGGCTGGGAACCACAGCGCCTTCTGGGGCATGACCCTGGATGAGGGCAT 

TCGCTACCGCCTGGGCACCATCCGCCCATCTTCCTCGGTCATGAACATGCATGAAATTTATACAGTGCTGAACCC 

AGGGGAGGTGCTTCCCACAGCCTTCGAGGCCTCTGAGAAGTGGCCCAACCTGATTCATGAGCCTCTTGACCAAGG 

CAACTGTGCAGGCTCCTGGGCCTTCTCCACAGCAGCTGTGGCATCCGATCGTGTCTCAATCCATTCTCTGGGACA 

CATGACGCCTGTCCTGTCGCCCCAGAACCTGCTGTCTTGTGACACCCACCAGCAGCAGGGCTGCCGCGGTGGGCG 

TCTCGATGGTGCCTGGTGGTTCCTGCGTCGCCGAGGGGTGGTGTCTGACCACTGCTACCCCTTCTCGGGCCGTGA 

ACGAGACGAGGCTGGCCCTGCGCCCCCCTGTATGATGCACAGCCGAGCCATGGGTCGGGGCAAGCGCCAGGCCAC 

TGCCCACTGCCCCAACAGCTATGTTAATAACAATGACATCTACCAGGTCACTCCTGTCTACCGCCTCGGCTCCAA 

CGACAAGGAGATCATGAAGGAGCTGATGGAGAATGGCCCTGTCCAAGCCCTCATGGAGGTGCATGAGGACTTCTT 

CCTATACAAGGGAGGCATCTACAGCCACACGCCAGTGAGCCTTGGGAGGCCAGAGAGATACCGCCGGCATGGGAC 

CCACTCAGTCAAGATCACAGGATGGGGAGAGGAGACGCTGCCAGATGGAAGGACGCTCAAATACTGGACTGCGGC 

CAACTCCTGGGGCCCAGCCTGGGGCGAGAGGGGCCACTTCCGCATCGTGCGCGGCGTCAATGAGTGCGACATCGA 

GAGCTTCGTGCTGGGCGTCTGGGGCCGCGTGGGCATGGAGGACATGGGTCATCACTGAGGCTGCGGGCACCACGC 

GGGGTCCGGCCTGGGATCCAGGCTAAGGGCCGGCGGAAGAGGCCCCAATGGGGCGGTGACCCCAGCCTCGCCCGA 

CAGAGCCCGGGGCGCAGGCGGGCGCCAGGGCGCTAATCCCGGCGCGGGTTCCGCTGACGCAGCGCCCCGCCTGGG 

AGCCGCGGGCAGGCGAGACTGGCGGAGCCCCCAGACCTCCCAGTGGGGACGGGGCAGGGCCTGGCCTGGGAAGAG 

CACAGCTGCAGATCCCAGGCCTCTGGCGCCCCCACTCAAGACTACCAAAGCCAGGACACCTCAAGTCTCCAGCCC 

CAATACCCCACCCCAATCCCGTATTCTTTTTTTTTTTTTTTTAGACAGGGTCTTGCTCCGTTGCCCAGGTTGGAG 

TGCAGTGGCCCATCAGGGCTCACTGTAACCTCCGACTCCTGGGTTCAAGTGACCCTCCCACCTCAGCCTCTCAAG 

TAGCTGGGACTACAGGTGCACCACCACACCTGGCTAATTTTTGTATTTTTTGTAAAGAGGGGGGTCTCACTGTGT 

TGCCCAGGCTGGTTTCGAACTCCTGGGCTCAAGCGGTCCACCTGCCTCCGCCTCCCAAAGTGCTGGGATTGCAGG 

CATGAGCCACTGCACCCAGCCCTGTATTCTTATTCTTCAGATATTTATTTTTCTTTTCACTGTTTTAAAATAAAA 



CCAAAGTATTGATAAAAAAAAA 
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FIGURE 314 

MWRCPLGLLLLLPLAGHLALGAQQGRGRRELAPGLHLRGIRDAGGRYCQEQDLCCRGRADDCA 
LPYLGAICYCDLFCNRTVSDCCPDFWDFCLGVPPPFPPIQGCMHGGRIYPVLGTYWDNCNRCT 

CQEN RQWHGG S RH DQ S HQ PGQLWLAGWE PQRLLGH DPG 

Important features: 
N-glycosylation site. 

amino acids 78-82, 161-165 

Casein kinase II phosphorylation site. 

amino acids 80-84, 117-121, 126-130, 169-173, 205-209, 296-300, 
411-415 

N-myristoylation site. 

amino acids 21-27, 39-45, 44-50, 104-110, 160-164, 224-230, 
269-275, 378-384, 442-448 

Ami da t ion site. 

amino acids 26-30, 318-322 



Eukaryotic thiol (cysteine) proteases histidine active 

amino acids 398-409 
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FIGURE 315 

CGGACGCGTGGGCCCCTGGTGGGCCCAGCAAGATGGATCTACTGTGGATCCTGCCCTCCCTGT 
GGCTTCTCCTGCTTGGGGGGCCTGCCTGCCTGAAGACCCAGGAACACCCCAGCTGCCCAGGAC 
CCAGGGAACTGGAAGCCAGCAAAGTTGTCCTCCTGCCCAGTTGTCCCGGAGCTCCAGGAAGTC 
CTGGGGAGAAGGGAGCCCCAGGTCCTCAAGGGCCACCTGGACCACCAGGCAAGATGGGCCCCA 
AGGGTGAGCCAGGCCCCAGAAACTGCCGGGAGCTGTTGAGCCAGGGCGCCACCTTGAGCGGCT 
GGTACCATCTGTGCCTACCTGAGGGCAGGGCCCTCCCAGTCTTTTGTGACATGGACACCGAGG 
GGGGCGGCTGGCTGGTGTTTCAGAGGCGCCAGGATGGTTCTGTGGATTTCTTCCGCTCTTGGT 
CCTCCTACAGAGCAGGTTTTGGGAACCAAGAGTCTGAATTCTGGCTGGGAAATGAGAATTTGC 
ACCAGCTTACTCTCCAGGGTAACTGGGAGCTGCGGGTAGAGCTGGAAGACTTTAATGGTAACC 
GTACTTTCGCCCACTATGCGACCTTCCGCCTCCTCGGTGAGGTAGACCACTACCAGCTGGCAC 
TGGGCAAGTTCTCAGAGGGCACTGCAGGGGATTCCCTGAGCCTCCACAGTGGGAGGCCCTTTA 
CCACCTATGACGCTGACCACGATTCAAGCAACAGCAACTGTGCAGTGATTGTCCACGGTGCCT 
GGTGGTATGCATCCTGTTACCGATCAAATCTCAATGGTCGCTATGCAGTGTCTGAGGCTGCCG 
CCCACAAATATGGCATTGACTGGGCCTCAGGCCGTGGTGTGGGCCACCCCTACCGCAGGGTTC 
GGATGATGCTTCGATAGGGCACTCTGGCAGCCAGTGCCCTTATCTCTCCTGTACAGCTTCCGG 
ATCGTCAGCCACCTTGCCTTTGCCAACCACCTCTGCTTGCCTGTCCACATTTAAAAATAAAAT 
CATTTTAGCCCTTTCA 
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FTGURE 316 

MDLLWILPSLWLLLLGGPACLKTQEHPSCPGPRELEASKVVLLPSCPGAPGSPGEKGAPGPQG 
PPGPPGKMGPKGEPGPRNCRELLSQGATLSGWYHLCLPEGRALPVFCDMDTEGGGWLVFQRRQ 
DGSVDFFRSWSSYRAGFGNQESEFWLGNENLHQLTLQGNWELRVELEDFNGNRTFAHYATFRL 
LGEVDHYQLALGKFSEGTAGDSLSLHSGRPFTTYDADHDSSNSNCAVIVHGAWWYASCYRSNL 

NGRYAVSEAAAHKYGIDWASGRGVGHPYRRVRMMLR 

Important features: 
Signal peptide: 

Amino acids 1-16 

N-glycosylation site: 

Amino acids 178-182 

Glycosaminoglycan attachment site: 

Amino acids 272-27 6 

Tyrosine kinase phosphorylation site: 

Amino acids 188-197 

N-myristoylation sites: 

Amino acids 16-22; 89-95; 144-150;267-273 

Fibrinogen beta and gamma chains C- terminal domain signature: 

Amino acids , 242-255 
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FIGURE 317 

CCCAAGCCAGCCGAGCCGCCAGAGCCGCGGGCCGCGGGGGTGTCGCGGGCCCAACCCCAGGAT 
6CTCCCCTGCGCCTCCTGCCTACCCGGGTCTCTACTGCTCTGGGCGCTGCTACTGTTGCTCTT 
GGGATCAGCTTCTCCTCAGGATTCTGAAGAGCCCGACAGCTACACGGAATGCACAGATGGCTA 
TGAGTGGGACCCAGACAGCCAGCACTGCCGGGATGTCAACGAGTGTCTGACCATCCCTGAGGC 
CTGCAAGGGGGAAATGAAGTGCATCAACCACTACGGGGGCTACTTGTGCCTGCCCCGCTCCGC 
TGCCGTCATCAACGACCTACATGGCGAGGGACCCCCGCCACCAGTGCCTCCCGCTCAACACCC 
CAACCCCTGCCCACCAGGCTATGAGCCCGACGATCAGGACAGCTGTGTGGATGTGGACGAGTG 
TGCCCAGGCCCTGCACGACTGTCGCCCCAGCCAGGACTGCCATAACTTGCCTGGCTCCTATCA 
GTGCACCTGCCCTGATGGTTACCGCAAGATCGGGCCCGAGTGTGTGGACATAGACGAGTGCCG 
CTACCGCTACTGCCAGCACCGCTGCGTGAACCTGCCTGGCTCCTTCCGCTGCCAGTGCGAGCC 
GGGCTTCCAGCTGGGGCCTAACAACCGCTCCTGTGTTGATGTGAACGAGTGTGACATGGGGGC 
CCCATGCGAGCAGCGCTGCTTCAACTCCTATGGGACCTTCCTGTGTCGCTGCCACCAGGGCTA 
TGAGCTGCATCGGGATGGCTTCTCCTGCAGTGATATTGATGAGTGTAGCTACTCCAGCTACCT 
CTGTCAGTACCGCTGCGTCAACGAGCCAGGCCGTTTCTCCTGCCACTGCCCACAGGGTTACCA 
GCTGCTGGCCACACGCCTCTGCCAAGACATTGATGAGTGTGAGTCTGGTGCGCACCAGTGCTC 
CGAGGCCCAAACCTGTGTCAACTTCCATGGGGGCTACCGCTGCGTGGACACCAACCGCTGCGT 
GGAGCCCTACATCCAGGTCTCTGAGAACCGCTGTCTCTGCCCGGCCTCCAACCCTCTATGTCG 
AGAGCAGCCTTCATCCATTGTGCACCGCTACATGACCATCACCTCGGAGCGGAGCGTGCCCGC 
TGACGTGTTCCAGATCCAGGCGACCTCCGTCTACCCCGGTGCCTACAATGCCTTTCAGATCCG 
TGCTGGAAACTCGCAGGGGGACTTTTACATTAGGCAAATCAACAACGTCAGCGCCATGCTGGT 
CCTCGCCCGGCCGGTGACGGGCCCCCGGGAGTACGTGCTGGACCTGGAGATGGTCACCATGAA 
TTCCCTCATGAGCTACCGGGCCAGCTCTGTACTGAGGCTCACCGTCTTTGTAGGGGCCTACAC 
CTTC TGA GGAGCAGGAGGGAGCCACCCTCCCTGCAGCTACCCTAGCTGAGGAGCCTGTTGTGA 
GGGGCAGAATGAGAAAGGCAATAAAGGGAGAAAGAAAGTCCTGGTGGCTGAGGTGGGCGGGTC 
ACACTGCAGGAAGCCTCAGGCTGGGGCAGGGTGGCACTTGGGGGGGCAGGCCAAGTTCACCTA 
AATGGGGGTCTCTATATGTTCAGGCCCAGGGGCCCCCATTGACAGGAGCTGGGAGCTCTGCAC 
CACGAGCTTCAGTCACCCCGAGAGGAGAGGAGGTAACGAGGAGGGCGGACTCCAGGCCCCGGC 
CCAGAGATTTGGACTTGGCTGGCTTGCAGGGGTCCTAAGAAACTCCACTCTGGACAGCGCCAG 
GAGGCCCTGGGTTCCATTCCTAACTCTGCCTCAAACTGTACATTTGGATAAGCCCTAGTAGTT 
CCCTGGGCCTGTTTTTGTATAAAACGAGGCAACTGGAAAAAAAAAAAA 
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FIGURE 318 

MLPCASCLPGSLLLWALLLLLLGSASPQDSEEPDSYTECTDGYEWDPDSQHCRDVNECLTIPE 
ACKGEMKCINHYGGYLCLPRSAAVINDLHGEGPPPPVPPAQHPNPCPPGYEPDDQDSCVDVDE 
CAQALHDCRPSQDCHNLPGSYQCTCPDGYRKIGPECVDIDECRYRYCQHRCVNLPGSFRCQCE 
PGFQLGPNNRSCVDVNECDMGAPCEQRCFNSYGTFLCRCHQGYELHRDGFSCSDIDECSYSSY 
LCQYRCVNEPGRFSCHCPQGYQLLATRLCQDIDECESGAHQCSEAQTCVNFHGGYRCVDTNRC 
VEPYIQVSENRCLCPASNPLCREQPSSIVHRYMTITSERSVPADVFQIQATSVYPGAYNAFQI 
RAGNSQGDFYIRQINNVSAMLVLARPVTGPREYVLDLEMVTMNSLMSYRASSVLRLTVFVGAYTF 

Important features: 
Signal sequence: 

Amino acids 1-25 

N-glycosylation sites : 

Amino acids 198-202/394-398 

N-myristoylation sites : 

Amino acids 76-82; 145-151 ; 182-188 ; 222-228; 290-296; 305-311 ; 
371-377;381-387 

Aspartic acid and asparagine hydroxylation sites: 

amino acids 140-152 ; 177-189; 217-229; 258-270 
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FIGURE 319 

GCTGGGGACATGAGAGGCACACCGAAGACCCACCTCCTGGCCTTCTCCCTCCTCTGCCTCCTC 
TCAAAGGTGCGTACCCAGCTGTGCCCGACACCATGTACCTGCCCCTGGCCACCTCCCCGATGC 
CCGCTGGGAGTACCCCTGGTGCTGGATGGCTGTGGCTGCTGCCGGGTATGTGCACGGCGGCTG 
GGGGAGCCCTGCGACCAACTCCACGTCTGCGACGCCAGCCAGGGCCTGGTCTGCCAGCCCGGG 
GCAGGACCCGGTGGCCGGGGGGCCCTGTGCCTCTTGGCAGAGGACGACAGCAGCTGTGAGGTG 
AACGGCCGCCTGTATCGGGAAGGGGAGACCTTCCAGCCCCACTGCAGCATCCGCTGCCGCTGC 
GAGGACGGCGGCTTCACCTGCGTGCCGCTGTGCAGCGAGGATGTGCGGCTGCCCAGCTGGGAC 
TGCCCCCACCCCAGGAGGGTCGAGGTCCTGGGCAAGTGCTGCCCTGAGTGGGTGTGCGGCCAA 
GGAGGGGGACTGGGGACCCAGCCCCTTCCAGCCCAAGGACCCCAGTTTTCTGGCCTTGTCTCT 
TCCCTGCCCCCTGGTGTCCCCTGCCCAGAATGGAGCACGGCCTGGGGACCCTGCTCGACCACC 
TGTGGGCTGGGCATGGCCACCCGGGTGTCCAACCAGAACCGCTTCTGCCGACTGGAGACCCAG 
CGCCGCCTGTGCCTGTCCAGGCCCTGCCCACCCTCCAGGGGTCGCAGTCCACAAAACAGTGCC 
TTCTAGAGCGGGGCTGGGAATGGGGACACGGTGTCCACCATCCCCAGCTGGTGGCCCTGTGCC 
TGGGCCCTGGGCTGATGGAAGATGGTCCGTGCCCAGGCCCTTGGCTGCAGGCAACACTTTAGC 
TTGGGTCCACCATGCAGAACACCAATATTAACACGCTGCCTGGTCTGTCTGGATCCCGAGGTA 
TGGCAGAGGTGCAAGACCTAGTCCCCTTTCCTCTAACTCACTGCCTAGGAGGCTGGCCAAGGT 
GTCCAGGGTCCTCTAGCCCACTCCCTGCCTACACACACAGCCTATATCAAACATGCACACGGG 
CGAGCTTTCTCTCCGACTTCCCCTGGGCAAGAGATGGGACAAGCAGTCCCTTAATATTGAGGC 
TGCAGCAGGTGCTGGGCTGGACTGGCCATTTTTCTGGGGGTAGGATGAAGAGAAGGCACACAG 
AGATTCTGGATCTCCTGCTGCCTTTTCTGGAGTTTGTAAAATTGTTCCTGAATACAAGCCTAT 

GCGTGA 
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FIGURE 320 

MRGTPKTHLLAFSLLCLLSKVRTQLCPTPCTCPWPPPRCPLGVPLVLDGCGCCRVCARRLGEP 
CDQLHVCDASQGLVCQPGAGPGGRGALCLLAEDDSSCEVNGRLYREGETFQPHCSIRCRCEDG 
GFTCVPLCSEDVRLPSWDCPHPRRVEVLGKCCPEWVCGQGGGLGTQPLPAQGPQFSGLVSSLP 
PGVPCPEWSTAWGPCSTTCGLGMATRVSNQNRFCRLETQRRLCLSRPCPPSRGRSPQNSAF 

Important features: 

Signal sequence: 
Amino acids 1-23 

N-myristoylation sites: 

Amino acids 3-9; 4 9-55; 81-87 ; 85-91; 126-132 ; 164-170; 166-172; 
167-173; 183-189; 209-215 

Insulin-like growth factor binding proteins signature: 

Amino acids 4 9-65 

von Willebrand CI domain: 

Amino acids 107-124 

Thrombospondin 1 Homology Block: 

Amino acids 201-216 

IGF binding protein site: 

Amino acids 49-58 
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FIGURE 321 

AGAACCTCAGAAATGTGAGTTATTTGGGAATGGCTGTTTGTAAATGTCCTTACGTAAGCCAAG 
AGGAGGTCTTGACTTGGGGTCCCAGGGGTACCGCAGATCCCAGGGACTGGAGCAGCACTAGCA 
AGCTCTGGAGGATGAGCCAGGAGTCTGGAATTGAGGCTGAGCCAAAGACCCCAGGGCCGTCTC 
AGTCTCATAAAAGGGGATCAGGCAGGAGGAGTTTGGGAGAAACCTGAGAAGGGCCTGATTTGC 
AGCATCMGATGGGCCTCTCCTTGGCCTCTGCTGTGCTCCTGGCCTCCCTCCTGAGTCTCCAC 
CTTGGAACTGCCACACGTGGGAGTGACATATCCAAGACCTGCTGCTTCCAATACAGCCACAAG 
CCCCTTCCCTGGACCTGGGTGCGAAGCTATGAATTCACCAGTAACAGCTGCTCCCAGCGGGCT 
GTGATATTCACTACCAAAAGAGGCAAGAAAGTCTGTACCCATCCAAGGAAAAAATGGGTGCAA 
AAATACATTTCTTTACTGAAAACTCCGAAACAATTGTGACTCAGCTGAATTTTCATCCGAGGA 
CGCTTGGACCCCGCTCTTGGCTCTGCAGCCCTCTGGGGAGCCTGCGGAATCTTTTCTGAAGGC 
TACATGGACCCGCTGGGGAGGAGAGGGTGTTTCCTCCCAGAGTTACTTTAATAAAGGTTGTTC 
ATAGAGTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 322 

MMGLSLASAVLLASLLSLHLGTATRGSDISKTCCFQYSHKPLPWTWVRSYEFTSNSCSQRAVI 
FTTKRGKKVCTHPRKKWVQKYISLLKTPKQL 

Important features: 
Signal peptide: 

amino acids 1-23 

N-myristoylation sites . 

amino acids 3-9, 26-32 

Amidation site. 

amino acids 68-72 

Small cytokines (intecrine/chemokine) . 

amino acids 23-88 



WO 01/40466 



PCT/US00/32678 



323/550 

FIGURE 323 

ACCGAGCCGAGCGGACCGAAGGCGCGCCCGAGATGCAGGTGAGCAAGAGGATGCTGGCGGGGGGCGTGAGGAGCA 
TGCCCAGCCCCCTCCTGGCCTGCTGGCAGCCCATCCTCCTGCTGGTGCTGGGCTCAGTGCTGTCAGGCTCGGCCA 
CGGGCTGCCCGCCCCGCTGCGAGTGCTCCGCCCAGGACCGCGCTGTGCTGTGCCACCGCAAGTGCTTTGTGGCAG 
TCCCCGAGGGCATCCCCACCGAGACGCGCCTGCTGGACCTAGGCAAGAACCGCATCAAAACGCTCAACCAGGACG 
AGTTCGCCAGCTTCCCGCACCTGGAGGAGCTGGAGCTCAACGAGAACATGGTGAGCGCCGTGGAGCCCGGCGCCT 
TCAACAACCTCTTCAACCTCCGGACGCTGGGTCTCCGCAGCAACCGCCTGAAGCTCATCCCGCTAGGCGTCTTCA 
CTGGCCTCAGCAACCTGACCAAGCAGGACATCAGCGAGAACAAGATCGTTATCCTACTGGACTACATGTTTCAGG 
ACCTGTACAACCTCAAGTCACTGGAGGTTGGCGACAATGACCTCGTCTACATCTCTCACCGCGCCTTCAGCGGCC 
TCAACAGCCTGGAGCAGCTGACGCTGGAGAAATGCAACCTGACCTCCATCCCCACCGAGGCGCTGTCCCACCTGC 
ACGGCCTCATCGTCCTGAGGCTCCGGCACCTCAACATCAATGCCATCCGGGACTACTCCTTCAAGAGGCTGTACC 
GACTCAAGGTCTTGGAGATCTCCCACTGGCCCTACTTGGACACCATGACACCCAACTGCCTCTACGGCCTCAACC 
TGACGTCCCTGTCCATCACACACTGCAATCTGACCGCTGTGCCCTACCTGGCCGTCCGCCACCTAGTCTATCTCC 
GCTTCCTCAACCTCTCCTACAACCCCATCAGCACCATTGAGGGCTCCATGTTGCATGAGCTGCTCCGGCTGCAGG 
AGATCCAGCTGGTGGGCGGGCAGCTGGCCGTGGTGGAGCCCTATGCCTTCCGCGGCCTCAACTACCTGCGCGTGC 
TCAATGTCTCTGGCAACCAGCTGACCACACTGGAGGAATCAGTCTTCCACTCGGTGGGCAACCTGGAGACACTCA 
TCCTGGACTCCAACCCGCTGGCCTGCGACTGTCGGCTCCTGTGGGTGTTCCGGCGCCGCTGGCGGCTCAACTTGA 
ACCGGCAGCAGCCCACGTGCGCCACGCCCGAGTTTGTCCAGGGCAAGGAGTTCAAGGACTTCCCTGATGTGCTAC 
TGCCCAACTACTTCACCTGCCGCCGCGCCCGCATCCGGGACCGCAAGGCCCAGCAGGTGTTTGTGGACGAGGGCC 
ACACGGTGCAGTTTGTGTGCCGGGCCGATGGCGACCCGCCGCCCGCCATCCTCTGGCTCTCACCCCGAAAGCACC 
TGGTCTCAGCCAAGAGCAATGGGCGGCTCACAGTCTTCCCTGATGGCACGCTGGAGGTGCGCTACGCCCAGGTAC 
AGGACAACGGCACGTACCTGTGCATCGCGGCCAACGCGGGCGGCAACGACTCCATGCCCGCCCACCTGCATGTGC 
GCAGCTACTCGCCCGACTGGCCCCATCAGCCCAACAAGACCTTCGCTTTCATCTCCAACCAGCCGGGCGAGGGAG 
AGGCCAACAGCACCCGCGCCACTGTGCCTTTCCCCTTCGACATCAAGACCCTCATCATCGCCACCACCATGGGCT 
TCATCTCTTTCCTGGGCGTCGTCCTCTTCTGCCTGGTGCTGCTGTTTCTCTGGAGCCGGGGCAAGGGCAACACAA 
AGCACAACATCGAGATCGAGTATGTGCCCCGAAAGTCGGACGCAGGCATCAGCTCCGCCGACGCGCCCCGCAAGT 
TCAACATGAAGATGATATGAGGCCGGGGCGGGGGGCAGGGACCCCCGGGCGGCCGGGCAGGGGAAGGGGCCTGGT 
CGCCACCTGCTCACTCTCCAGTCCTTCCCACCTCCTCCCTACCCTTCTACACACGTTCTCTTTCTCCCTCCCGCC 
TCCGTCCCCTGCTGCCCCCCGCCAGCCCTCACCACCTGCCCTCCTTCTACCAGGACCTCAGAAGCCCAGACCTGG 
GGACCCCACCTACACAGGGGCATTGACAGACTGGAGTTGAAAGCCGACGAACCGACACGCGGCAGAGTCAATAAT 
TCAATAAAAAAGTTACGAACTTTCTCTGTAACTTGGGTTTCAATAATTATGGATTTTTATGAAAACTTGAAATAA 
TAAAAAGAGAAAAAAACTAAAAAAAAAAAAAAAAAAAAAA* 
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FIGURE 324 

MQVSKRMLAGGVRSMPSPL1ACWQPILLLVLGSVLSGSATGCPPRCECSAQDRAVLCHRKCFVAVPEGIPTETRL 
LDLGKNRIKTLNQDEFASFPHLEELELNENIVSAVEPGAFNNLFNLRTLGLRSNRLKLIPLGVFTGLSNLTKQDI 
SENKIVILLDYMFQDLYNLKSLEVGDNDLVYISHRAFSGLNSLEQLTLEKCNLTSIPTEALSHLHGLIVLRLRHL 
NINAIRDYSFKRLYRLKVLEISHWPYLDTMT PNCLYGLNLTSLSITHCNLTAVPYLAVRHLVYLRFLNLSYNPIS 
TIEGSMLHELLRLQEIQLVGGQLAVVEPYAFRGLNYLRVLNVSGNQLTTLEESVFHSVGNLETLILDSNPLACDC 
RLLWVFRRRWRLNFNRQQPTCATPEFVQGKEFKDFPDVLLPNYFTCRRARIRDRBCAQQVFVDEGHTVQFVCRADG 
DPPPAILWLSPRKHLVSAKSNGRLTVFPDGTLEVRYAQVQDNGTYLCIAANAGGNDSMPAHLHVRSYSPDWPHQP 
NKTFAFISNQPGEGEANSTRATVPFPFDIKTLIIATTMGFISFLGWLFCLVLLFLWSRGKGNTKHNIEIEYVPR 
KSDAGISSADAPRKFNMKMI 

Important features: 
Signal sequence: 

amino acids 1-41 

Transmembrane domain: 

amino acids 556-578 

N-glycosylation site. 

amino acids 144-148, 202-206, 264-268, 274-278, 293-297, 341-345, 492-496, 
505-509, 526-530, 542-546 

Casein kinase II phosphorylation site. 

amino acids 49-53, 108-112, 146-150, 300-304, 348-352, 349-353, 607-611 

Tyrosine kinase phosphorylation site. 

amino acids 590-598 

N-myristoylation site. 

amino acids 10-16, 32-38, 37-43, 113-119, 125-131, 137-143, 262-268, 320-326, 
344-350, 359-365, 493-499, 503-509, 605-611 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 32-43 
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FIGURE 325 

CCCACGCGTCCGCCCACGCGTCCGAGGGACAAGAGAGAAGAGAGACTGAAACAGGGAGAAGAG 
GCAGGAGAGGAGGAGGTGGGGAGAGCACGAAGCTGGAGGCCGACACTGAGGGAGGGCGGGAGG 
AGGTGAAGAAGGAGAGAGGGGAGAAGAGGCAGGAGCTGGAAAGGAGAGAGGGAGGAGGAGGAG 
GAGATGCGGGATGGAGACCTGGAGTTAGGTGGCTTGGGAGAGCTTAATGAAAAGAGAACGGAG 
AGGAGGTGTGGGTTAGGAACCAAGAGGTAGCCCTGTGGGCAGCAGAAGGCTGAGAGGAGTAGG 
AAGATCAGGAGCTAGAGGGAGACTGGAGGGTTCCGGGAAAAGAGCAGAGGAAAGAGGAAAGAC 
ACAGAGAGACGGGAGAGAGAAGAAGAGTGGGTTTGAAGGGCGGATCTCAGTCCCTGGCTGCTT 
TGGCATTTGGGGAACTGGGACTCCCTGTGGGGAGGAGAGGAAAGCTGGAAGTCCTGGAGGGAC 
AGGGTCCCAGAAGGAGGGGACAGAGGAGCTGAGAGAGGGGGGCAGGGCGTTGGGCAGGGGTCC 
CTCGGAGGCCTCCTGGGGATGGGGGCTGCAGCTCGTCTGAGCGCCCCTCGAGCGCTGGTACTC 
TGGGCTGCACTGGGGGCAGCAGCTCACATCGGACCAGCACCTGACCCCGAGGACTGGTGGAGC 
TACAAGGATAATCTCCAGGGAAACTTCGTGCCAGGGCCTCCTTTCTGGGGCCTGGTGAATGCA 
GCGTGGAGTCTGTGTGCTGTGGGGAAGCGGCAGAGCCCCGTGGATGTGGAGCTGAAGAGGGTT 
CTTTATGACCCCTTTCTGCCCCCATTAAGGCTCAGCACTGGAGGAGAGAAGCTCCGGGGAACC 
TTGTACAACACCGGCCGACATGTCTCCTTCCTGCCTGCACCCCGACCTGTGGTCAATGTGTCT 
GGAGGTCCCCTCCTTTACAGCCACCGACTCAGTGAACTGCGGCTGCTGTTTGGAGCTCGCGAC 
GGAGCCGGCTCGGAACATCAGATCAACCACCAGGGCTTCTCTGCTGAGGTGCAGCTCATTCAC 
TTCAACCAGGAACTCTACGGGAATTTCAGCGCTGCCTCCCGCGGCCCCAATGGCCTGGCCATT 
CTCAGCCTCTTTGTCAACGTTGCCAGTACCTCTAACCCATTCCTCAGTCGCCTCCTTAACCGC 
GACACCATCACTCGCATCTCCTACAAGAATGATGCCTACTTTCTTCAAGACCTGAGCCTGGAG 
CTCCTGTTCCCTGAATCCTTCGGCTTCATCACCTATCAGGGCTCTCTCAGCACCCCGCCCTGC 
TCCGAGACTGTCACCTGGATCCTCATTGACCGGGCCCTCAATATCACCTCCCTTCAGATGCAC 
TCCCTGAGACTCCTGAGCCAGAATCCTCCATCTCAGATCTTCCAGAGCCTCAGCGGTAACAGC 
CGGCCCCTGCAGCCCTTGGCCCACAGGGCACTGAGGGGCAACAGGGACCCCCGGCACCCCGAG 
AGGCGCTGCCGAGGCCCCAACTACCGCCTGCATGTGGATGGTGTCCCCCATGGTCGCJG&GAC 
TCCCCTTCGAGGATTGCACCCGCCCGTCCTAAGCCTCCCCACAAGGCGAGGGGAGTTACCCCT 
AAAACAAAGCTATTAAAGGGACAGAATACTTA 
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FIGURE 326 

MGAAARL S A PRAL VL WAALG AAAH I G PA P D PE DWWS YK DN LQGN FV P G P P FWGLVN AAWS LC A 
VGKRQSPVDVELKRVLYDPFLPPLRLSTGGEKLRGTLYNTGRHVSFLPAPRPWNVSGGPLLY 
SHRLSELRLLFGARDGAGSEHQINHQGFSAEVQLIHFNQELYGNFSAASRGPNGLAILSLFVN 
VASTSNPFLSRLLNRDTITRISYKNDAYFLQDLSLELLFPESFGFITYQGSLSTPPCSETVTW 
ILIDRALNITSLQMHSLRLLSQNPPSQIFQSLSGNSRPLQPLAHRALRGNRDPRHPERRCRGP 
NYRLHVDGVPHGR 

Important features: 
Signal peptide: 

Amino acids 1-23 

Transmembrane domain: 

Amino acids 177-199 

N-glycosylation sites: 

Amino acids 118-122; l : 70-174;260-264 

Eukaryotic-type carbonic anhydrases proteins: 

Amino acids 222-271; 128-165; 45-93 
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FIGURE 327 

GGACTAATCTGTGGGAGCAGTTTATTCCAGTATCACCCAGGGTGCAGCCACACCAGGACTGTGTTGAAGGGTGTT 
TTTTTTCTTTTAAATGTAATACCTCCTCATCTTTTCTTCTTACACAGTGTCTGAGAACATTTACATTATAGATAA 
GTAGTACATGGTGGATAACTTCTACTTTTAGGAGGACTACTCTCTTCTGACAGTCCTAGACTGGTCTTCTACACT 
AAGACACCATGAAGGAGTATGTGCTCCTATTATTCCTGGCTTTGTGCTCTGCCAAACCCTTCTTTAGCCCTTCAC 
AC ATCGC ACT G AAG AAT ATGAT GCT GAAGGATATGGAAGAC ACAGAT GAT GAT GAT GAT GAT GAT GATGATGAT G 
ATGATGATGAGGACAACTCTCTTTTTCCAACAAGAGAGCCAAGAAGCCATTTTTTTCCATTTGATCTGTTTCCAA 
TGTGTCCATTTGGATGTCAGTGCTATTCACGAGTTGTACATTGCTCAGATTTAGGTTTGACCTCAGTCCCAACCA 
ACATTCCATTTGATACTCGAATGCTTGATCTTCAAAACAATAAAAT'TAAGGAAATCAAAGAAAATGATTTTAAAG 
GACTCACTTCACTTTATGGTCTGATCCTGAACAACAACAAGCTAACGAAGATTCACCCAAAAGCCTTTCTAACCA 
CAAAGAAGTTGCGAAGGCTGTATCTGTCCCACAATCAACTAAGTGAAATACCACTTAATCTTCCCAAATCATTAG 
CAGAACTCAGAATTCATGAAAATAAAGTTAAGAAAATACAAAAGGACACATTCAAAGGAATGAATGCTTTACACG 
TTTTGGAAATGAGTGCAAACCCTCTTGATAATAATGGGATAGAGCCAGGGGCATTTGAAGGGGTGACGGTGTTCC 
ATATCAGAATTGCAGAAGCAAAACTGACCTCAGTTCCTAAAGGCTTACCACCAACTTTATTGGAGCTTCACTTAG 
ATTATAATAAT^ATTTCAACAGTGGAACTTGAGGATTTTAAACGATACAAAGAACTACAAAGGCTGGGCCTAGGAA 
ACAACAAAATCACAGATATCGAAAATGGGAGTCTTGCTAACATACCACGTGTGAGAGAAATACATTTGGAAAACA 
ATAAACTAAAAAAAATCCCTTCAGGATTACCAGAGTTGAAATACCTCCAGATAATCTTCCTTCATTCTAATTCAA 
TTGCAAGAGTGGGAGTAAATGACTTCTGTCCAACAGTGCCAAAGATGAAGAAATCTTTATACAGTGCAATAAGTT 
TATTCAACAACCCGGTGAAATACTGGGAAATGCAACCTGCAACATTTCGTTGTGTTTTGAGCAGAATGAGTGTTC 
AGCTTGGGAACTTTGGAATGTAATAATTAGTAATTGGTAATGTCCATTTAATATAAGATTCAAAAATCCCTACAT 
TTGGAATACTTGAACTCTATTAATAATGGTAGTATTATATATACAAGCAAATATCTATTCTCAAGTGGTAAGTCC 
ACTGACTTATTTTATGACAAGAAATTTCAACGGAATTTTGCCAAACTATTGATACATAAGGGGTTGAGAGAAACA 
AGCATCTATTGCAGTTTCCTTTTTGCGTACAAATGATCTTACATAAATCTCATGCTTGACCATTCCTTTCTTCAT 
AACAAAAAAGTAAGATATTCGGTATTTAACACTTTGTTATCAAGCACATTTTAAAAAGAACTGTACTGTAAATGG 
AATGCTTGACTTAGCAAAATTTGTGCTCTTTCATTTGCTGTTAGAAAAACAGAATTAACAAAGACAGTAATGTGA 
AGAGTGCATTACACTATTCTTATTCTTTAGTAACTTGGGTAGTACTGTAATATTTTTAATCATCTTAAAGTATGA 
TTTGATATAATCTTATTGAAATTACCTTATCATGTCTTAGAGCCCGTCTTTATGTTTAAAACTAATTTCTTAAAA 
TAAAGCCTTCAGTAAATGTTCATTACCAACTTGATAAATGCTACTCATAAGAGCTGGTTTGGGGCTATAGCATAT 
GCTTTTTTTTTTTTAATTATTACCTGATTTAAAAATCTCTGTAAAAACGTGTAGTGTTTCATAAAATCTGTAACT 
CGCATTTTAATGATCCGCTATTATAAGCTTTTAATAGCATGAAAATTGTTAGGCTATATAACATTGCCACTTCAA 
CTCTAAGGAATATTTTTGAGATATCCCTTTGGAAGACCTTGCTTGGAAGAGCCTGGACACTAACAATTCTACACC 
AAATTGTCTCTTCAAATACGTATGGACTGGATAACTCTGAGAAACACATCTAGTATAACTGAATAAGCAGAGCAT 
CAAATTAAACAGACAGAAACCGAAAGCTCTATATAAATGCTCAGAGTTCTTTATGTATTTCTTATTGGCATTCAA 
CATATGTAAAATCAGAAAACAGGGAAATTTTCATTAAAAATATTGGTTTGAAAT 



WO 01/40466 



PCT/US00/32678 



328/550 

FIGURE 328 

MKEYVLLLFLALCSAKPFFSPSHIALKNMMLKDMEDTDDDDDDDDDDDDDEDNSLFPTREPRS 
HFFPFDLFPMCPFGCQCYSRVVHCSDLGLTSVPTNIPFDTRMLDLQNNKIKEIKENDFKGLTS 
LYGLILNNNKLTKIHPKAFLTTKKLRRLYLSHNQLSEIPLNLPKSLAELRIHENKVKKIQKDT 
FKGMNALHVLEMSANPLDNNGIEPGAFEGVTVFHIRIAEAKLTSVPKGLPPTLLELHLDYNKI 
STVELEDFKRYKELQRLGLGNNKITDIENGSLANIPRVREIHLENNKLKKIPSGLPELKYLQI 
IFLHSNSIARVGVNDFCPTVPKMKKSLYSAISLFNNPVKYWEMQPATFRCVLSRMSVQLGNFGM 

Important features: 
Signal sequence. 

amino acids 1-15 

N-glycosylation site. 

amino acids 281-285 

N-myristoylation sites. 

amino acids 129-135, 210-216, 214-220, 237-243, 270-276, 282-288 

Leucine zipper pattern. 

amino acids 154-176 
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FIGURE 329 

GGGGTCTCCCTCAGGGCCGGGAGGCACAGCGGTCCCTGCTTGCTGAAGGGCTGGATGTACGCA 
TCCGCAGGTTCCCGCGGACTTGGGGGCGCCCGCTGAGCCCCGGCGCCCGCAGAAGACTTGTGT 
TTGCCTCCTGCAGCCTCAACCCGGAGGGCAGCGAGGGCCTACCACCATGATCACTGGTGTGTT 
CAGCATGCGCTTGTGGACCCCAGTGGGCGTCCTGACCTCGCTGGCGTACTGCCTGCACCAGCG 
GCGGGTGGCCCTGGCCGAGCTGCAGGAGGCCGATGGCCAGTGTCCGGTCGACCGCAGCCTGCT 
GAAGTTGAAAATGGTGCAGGTCGTGTTTCGACACGGGGCTCGGAGTCCTCTCAAGCCGCTCCC 
GCTGGAGGAGCAGGTAGAGTGGAACCCCCAGCTATTAGAGGTCCCACCCCAAACTCAGTTTGA 
TTACACAGTCACCAATCTAGCTGGTGGTCCGAAACCATATTCTCCTTACGACTCTCAATACCA 
TGAGACCACCCTGAAGGGGGGCATGTTTGCTGGGCAGCTGACCAAGGTGGGCATGCAGCAAAT 
GTTTGCCTTGGGAGAGAGACTGAGGAAGAACTATGTGGAAGACATTCCCTTTCTTTCACCAAC 
CTTCAACCCACAGGAGGTCTTTATTCGTTCCACTAACATTTTTCGGAATCTGGAGTCCACCCG 
TTGTTTGCTGGCTGGGCTTTTCCAGTGTCAGAAAGAAGGACCCATCATCATCCACACTGATGA 
AGCAGATTCAGAAGTCTTGTATCCCAACTACCAAAGCTGCTGGAGCCTGAGGCAGAGAACCAG 
AGGCCGGAGGCAGACTGCCTCTTTACAGCCAGGAATCTCAGAGGATTTGAAAAAGGTGAAGGA 
CAGGATGGGCATTGACAGTAGTGATAAAGTGGACTTCTTCATCCTCCTGGACAACGTGGCTGC 
CGAGCAGGCACACAACCTCCCAAGCTGCCCCATGCTGAAGAGATTTGCACGGATGATCGAACA 
GAGAGCTGTGGACACATCCTTGTACATACTGCCCAAGGAAGACAGGGAAAGTCTTCAGATGGC 
AGTAGGCCCATTCCTCCACATCCTAGAGAGCAACCTGCTGAAAGCCATGGACTCTGCCACTGC 
CCCCGACAAGATCAGAAAGCTGTATCTCTATGCGGCTCATGATGTGACCTTCATACCGCTCTT 
AATGACCCTGGGGATTTTTGACCACAAATGGCCACCGTTTGCTGTTGACCTGACCATGGAACT 
TTACCAGCACCTGGAATCTAAGGAGTGGTTTGTGCAGCTCTATTACCACGGGAAGGAGCAGGT 
GCCGAGAGGTTGCCCTGATGGGCTCTGCCCGCTGGACATGTTCTTGAATGCCATGTCAGTTTA 
TACCTTAAGCCCAGAAAAATACCATGCACTCTGCTCTCAAACTCAGGTGATGGAAGTTGGAAA 
TGAAGAG^ACTGATTTATAAAAGCAGGATGTGTTGATTTTAAAATAAAGTGCCTTTATACAATG 
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FIGURE 330 

MITGVFSMRLWTPVGVLTSLAYCLHQRRVALAELQEADGQCPVDRSLLKLKMVQWFRHGARSPLKPLPLEEQVE 
WNPQLLEVPPQTQFDYTVTNLAGGPKPYSPYDSQYHETTLKGGMFAGQLTKVGMQQMFALGERLRKNYVEDIPFL 
SPTFNPQEVFIRSTNIFRNLESTRCLLAGLFQCQKEGPIIIHTDEADSEVLYPNYQSCWSLRQRTRGRRQTASLQ 
PGISEDLKKVKDRMGIDSSDKVDFFILLDNVAAEQAHNLPSCPMLKRFARMIEQRAVDTSLYILPKEDRESLQMA 
VGPFLHILESNLLKAMDSATAPDKIRKLYLYAAHDVTFIPLLMTLGIFDHKWPPFAVDLTMELYQHLESKEWFVQ 
LYYHGKEQVPRGCPDGLCPLDMFLNAMSVYTLSPEKYHALCSQTQVMEVGNEE 

Important features: 
Signal sequence : 

amino acids 1-23 

CAMP- and c©4P-dependent protein kinase phosphorylation site. 

amino acids 218-222 

Casein kinase II phosphorylation site. 

amino acids 87-91, 104-108, 320-324 

Tyrosine kinase phosphorylation site. 

amino acids 280-288 

N-myristoylation site. 

amino acids 15-21, 117-123, 118-124, 179-185, 240-246, 387-393 

Amidation site. 

amino acids 216-220 

Leucine zipper pattern. 

amino acids 10-32 

Histidine acid phosphatases phosphohistidine signature. 

amino acids 50-65 
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FIGURE 331 

CGAGGGCTTTTCCGGCTCCGGAATGGCACATGTGGGAATCCCAGTCTTGTTGGCTACAACATTTTTCCCTTTCCT 
AACAAGTTCTAACA'GCTGTTCTAACAGCTAGTGATCAGGGGTTCTTCTTGCTGGAGAAGAAAGGGCTGAGGGCAG 
AGCAGGGCACTCTCACTCAGGGTGACCAGCTCCTTGCCTCTCTGTGGATAACAGAGCATGAGAAAGTGAAGAGAT 
GCAGCGGAGTGAGGTGATGGAAGTCTAAAATAGGAAGGAATTTTGTGTGCAATATCAGACTCTGGGAGCAGTTGA 
CCTGGAGAGCCTGGGGGAGGGCCTGCCTAACAAGCTTTCAAAAAACAGGAGCGACTTCCACTGGGCTGGGATAAG 
ACGTGCCGGTAGGATAGGGAAGACTGGGTTTAGTCCTAATATCAAATTGACTGGCTGGGTGAACTTCAACAGCCT 
TTTAACCTCTCTGGGAGATGAAAACGATGGGTTAAGGGGCCAGAAATAGAGATGCTTTGTAAAATAAAATTTTAA 
AAAAAGCAAGTATTTTATAGCATAAAGGCTAGAGACCAAAATAGATAACAGGATTCCCTGAACATTCCTAAGAGG 
GAGAAAGTATGTTAAAAATAGAAAAACCAAAATGCAGAAGGAGGAGACTCACAGAGCTAAACCAGGATGGGGACC 
CTGGGTCAGGCCAGCCTCTTTGCTCCTCCCGGAAATTATTTTTGGTCTGACCACTCTGCCTTGTGTTTTGCAGAA 
TCATGTGAGGGCCAACCGGGGAAGGTGGAGCAGATGAGCACACACAGGAGCCGTCTCCTCACCGCCGCCCCTCTC 
AGCATGGAACAGAGGCAGCCCTGGCCCCGGGCCCTGGAGGTGGACAGCCGCTCTGTGGTCCTGCTCTCAGTGGTC 
TGGGTGCTGCTGGCCCCCCCAGCAGCCGGCATGCCTCAGTTCAGCACCTTCCACTCTGAGAATCGTGACTGGACC 
TTCAACCACTTGACCGTCCACCAAGGGACGGGGGCCGTCTATGTGGGGGCCATCAACCGGGTCTATAAGCTGACA 
GGCAACCTGACCATCCAGGTGGCTCATAAGACAGGGCCAGAAGAGGACAACAAGTCTCGTTACCCGCCCCTCATC 
GTGCAGCCCTGCAGCGAAGTGCTCACCCTCACCAACAATGTCAACAAGCTGCTCATCATTGACTACTCTGAGAAC 
CGCCTGCTGGCCTGTGGGAGCCTCTACCAGGGGGTCTGCAAGCTGCTGCGGCTGGATGACCTCTTCATCCTGGTG 
GAGCCATCCCACAAGAAGGAGCACTACCTGTCCAGTGTCAACAAGACGGGCACCATGTACGGGGTGATTGTGCGC 
TCTGAGGGTGAGGATGGCAAGCTCTTCATCGGCACGGCTGTGGATGGGAAGCAGGATTACTTCCCGACCCTGTCC 
AGCCGGAAGCTGCCCCGAGACCCTGAGTCCTCAGCCATGCTCGACTATGAGCTACACAGCGATTTTGTCTCCTCT 
CTCATCAAGATCCCTTCAGACACCCTGGCCCTGGTCTCCCACTTTGACATCTTCTACATCTACGGCTTTGCTAGT 
GGGGGCTTTGTCTACTTTCTCACTGTCCAGCCCGAGACCCCTGAGGGTGTGGCCATCAACTCCGCTGGAGACCTC 
TTCTACACCTCACGCATCGTGCGGCTCTGCAAGGATGACCCCAAGTTCCACTCATACGTGTCCCTGCCCTTCGGC 
TGCACCCGGGCCGGGGTGGAATACCGCCTCCTGCAGGCTGCTTACC'TGGCCAAGCCTGGGGACTCACTGGCCCAG 
GCCTTCAATATCACCAGCCAGGACGATGTACTCTTTGCCATCTTCTCCAAAGGGCAGAAGCAGTATCACCACCCG 
CCCGATGACTCTGCCCTGTGTGCCTTCCCTATCCGGGCCATCAACTTGCAGATCAAGGAGCGCCTGCAGTCCTGC 
TACCAGGGCGAGGGCAACCTGGAGCTCAACTGGCTGCTGGGGAAGGACGTCCAGTGCACGAAGGCGCCTGTCCCC 
ATCGATGATAACTTCTGTGGACTGGACATCAACCAGCCCCTGGGAGGCTCAACTCCAGTGGAGGGCCTGACCCTG 
TACACCACCAGCAGGGACCGCATGACCTCTGTGGCCTCCTACGTTTACAACGGCTACAGCGTGGTTTTTGTGGGG 
ACTAAGAGTGGCAAGCTGAAAAAGGTAAGAGTCTATGAGTTCAGATGCTCCAATGCCATTCACCTCCTCAGCAAA 

GAGTCCCTCTTGGAAGGTAGCTATTGGTGGAGATTTAACTATAGGCAACTTTATTTTCTTGGGGAACAAAGGTa 
AATGGGGAGGTAAGAAGGGGTTAATTTTGTGACTTAGCTTCTAGCTACTTCCTCCAGCCATCAGTCATTGGGTAT 
GTAAGGAATGCAAGCGTATTTCAATATTTCCCAAACTTTAAGAAAAAACTTTAAGAAGGTACATCTGCAAAAGCAAA 
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FIGURE 332 

MGTLGQASLFAPPGNYFWSDHSALCFAESCEGQPGKVEQMSTHRSRLLTAAPLSMEQRQPWPR 
ALEVDSRSWLLSWWVLLAPPAAGMPQFSTFHSENRDWTFNHLTVHQGTGAVYVGAINRVYK 
LTGNLTIQVAHKTGPEEDNKSRYPPLIVQPCSEVLTLTNNVNKLLIIDYSENRLLACGSLYQG 
VCKLLRLDDLFILVEPSHKKEHYLSSVNKTGTMYGVIVRSEGEDGKLFIGTAVDGKQDYFPTL 
SSRKLPRDPESSAMLDYELHSDFVSSLIKIPSDTLALVSHFDIFYIYGFASGGFVYFLTVQPE 
TPEGVAINSAGDLFYTSRIVRLCKDDPKFHSYVSLPFGCTRAGVEYRLLQAAYLAKPGDSLAQ 
AFNITSQDDVLFAIFSKGQKQYHHPPDDSALCAFPIRAINLQIKERLQSCYQGEGNLELNWLL 
GKDVQCTKAPVPIDDNFCGLDINQPLGGSTPVEGLTLYTTSRDRMTSVASYVYNGYSWFVGT 
KSGKLKKVRVYEFRCSNAIHLLSKESLLEGSYWWRFNYRQLYFLGEQR 

Important features: 
Signal sequence: 

amino acids 1-32 

Transmembrane domain: 

amino acids 71-87 

N-glycosylation site. - 

amino acids 130-134, 145-149, 217-221, 381-385 
Casein kinase II phosphorylation site. 

amino acids 139-143, 229-233, 240-244, 291-295, 324-328, 383-387, 
384-388, 471-475, 481-485, 530-534 

N-myristoylation site. 

amino acids 220-226, 319-325, 353-359, 460-466, 503-509 
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FIGURE 333 

GCTGAGTCTGCTGCTCCTGCTGCTGCTGCTCCAGCCTGTAACCTGTGCCTACACCACGCCAGG 
CCCCCCCAGAGCCCTCACCACGCTGGGCGCCCCCAGAGCCCACACCaSSCCGGGCACCTACGC 
TCCCTCGACCACACTCAGTAGTCCCAGCACCCAGGGCCTGCAAGAGCAGGCACGGGCCCTGAT 
GCGGGACTTCCCGCTCGTGGACGGCCACAACGACCTGCCCCTGGTCCTAAGGCAGGTTTACCA 
GAAAGGGCTACAGGATGTTAACCTGCGCAATTTCAGCTACGGCCAGACCAGCCTGGACAGGCT 
TAGAGATGGCCTCGTGGGCGCCCAGTTCTGGTCAGCCTATGTGCCATGCCAGACCCAGGACCG 
GGATGCCCTGCGCCTCACCCTGGAGCAGATTGACCTCATACGCCGCATGTGTGCCTCCTATTC 
TGAGCTGGAGCTTGTGACCTCGGCTAAAGCTCTGAACGACACTCAGAAATTGGCCTGCCTCAT 
CGGTGTAGAGGGTGGCCACTCGCTGGACAATAGCCTCTCCATCTTACGTACCTTCTACATGCT 
GGGAGTGCGCTACCTGACGCTCACCCACACCTGCAACACACCCTGGGCAGAGAGCTCCGCTAA 
GGGCGTCCACTCCTTCTACAACAACATCAGCGGGCTGACTGACTTTGGTGAGAAGGTGGTGGC 
AGAAATGAACCGCCTGGGCATGATGGTAGACTTATCCCATGTCTCAGATGCTGTGGCACGGCG 
GGCCCTGGAAGTGTCACAGGCACCTGTGATCTTCTCCCACTCGGCTGCCCGGGGTGTGTGCAA 
CAGTGCTCGGAATGTTCCTGATGACATCCTGCAGCTTCTGAAGAAGAACGGTGGCGTCGTGAT 
GGTGTCTTTGTCCATGGGAGTAATACAGTGCAACCCATCAGCCAATGTGTCCACTGTGGCAGA 
TCACTTCGACCACATCAAGGCTGTCATTGGATCCAAGTTCATCGGGATTGGTGGAGATTATGA 
TGGGGCCGGCAAATTCCCTCAGGGGCTGGAAGACGTGTCCACATACCCGGTCCTGATAGAGGA 
GTTGCTGAGTCGTGGCTGGAGTGAGGAAGAGCTTCAGGGTGTCCTTCGTGGAAACCTGCTGCG 
GGTCTTCAGACAAGTGGAAAAGGTACAGGAAGAAAACAAATGGCAAAGCCCCTTGGAGGACAA 
GTTCCCGGATGAGCAGCTGAGCAGTTCCTGCCACTCCGACCTCTCACGTCTGCGTCAGAGACA 
GAGTCTGACTTCAGGCCAGGAACTCACTGAGATTCCCATACACTGGACAGCCAAGTTACCAGC 
CAAGTGGTCAGTCTCAGAGTCCTCCCCCCACATGGCCCCAGTCCTTGCAGTTGTGGCCACCTT 
CCCAGTCCTTATTCTGTGGCTCTGATGACCCAGTTAGTCCTGCCAGATGTCACTGTAGCAAGC 
CACAGACACCCCACAAAGTTCCCCTGTTGTGCAGGCACAAATATTTCCTGAAATAAATGTTTT 
GGACATAG 
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FIGURE 334 

MPGTYAPSTTLSSPSTQGLQEQARALMRDFPLVDGHNDLPLVLRQVYQKGLQDVNLRNFSYGQ 
TSLDRLRDGLVGAQFWSAYVPCQTQDRDALRLTLEQIDLIRRMCASYSELELVTSAKALNDTQ 
KLACLIGVEGGHSLDNSLSILRTFYMLGVRYLTLTHTCNTPWAESSAKGVHSFYNNISGLTDF 
GEKWAEMNRLGMMVDLSHVSDAVARRALEVSQAPVIFSHSAARGVCNSARNVPDDILQLLKK 
NGGWMVSLSMGVIQCNPSANVSTVADHFDHIKAVIGSKFIGIGGDYDGAGKFPQGLEDVSTY 
PVLIEELLSRGWSEEELQGVLRGNLLRVFRQVEKVQEENKWQSPLEDKFPDEQLSSSCHSDLS 
RLRQRQSLTSGQELTE I P I HWTAKL PAKWS VS ES S PHMAPVLAVVAT FP VL ILWL 

Important features: 
N-glycosylation sites. 

amino acids 58-62, 123-127, 182-186, 273-277 
N-myristoylation sites. 

amino acids 72-78, 133-139, 234-240, 264-270, 334-340, 389-395 

Renal dipeptidase active site. 

amino acids 134-157 
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FIGURE 335 

CCCAGAAGTTCAAGGGCCCCCGGCCTCCTGCGCTCCTGCCGCCGGGACCCTCGACCTCCTCAG 
AGCAGCCGGCTGCCGCCCCGGGAAGASSGCGAGGAGGAGCCGCCACCGCCTCCTCCTGCTGCT 
GCTGCGCTACCTGGTGGTCGCCCTGGGCTATCATAAGGCCTATGGGTTTTCTGCCCCAAAAGA 
CCAACAAGTAGTCACAGCAGTAGAGTACCAAGAGGCTATTTTAGCCTGCAAAACCCCAAAGAA 
GACTGTTTCCTCCAGATTAGAGTGGAAGAAACTGGGTCGGAGTGTCTCCTTTGTCTACTATCA 
ACAGACTCTTCAAGGTGATTTTAAAAATCGAGCTGAGATGATAGATTTCAATATCCGGATCAA 
AAATGTGACAAGAAGTGATGCGGGGAAATATCGTTGTGAAGTTAGTGCCCCATCTGAGCAAGG 
CCAAAACCTGGAAGAGGATACAGTCACTCTGGAAGTATTAGTGGCTCCAGCAGTTCCATCATG 
TGAAGTACCCTCTTCTGCTCTGAGTGGAACTGTGGTAGAGCTACGATGTCAAGACAAAGAAGG 
GAATCCAGCTCCTGAATACACATGGTTTAAGGATGGCATCCGTTTGCTAGAAAATCCCAGACT 
TGGCTCCCAAAGCACCAACAGCTCATACACAATGAATACAAAAACTGGAACTCTGCAATTTAA 
TACTGTTTCCAAACTGGACACTGGAGAATATTCCTGTGAAGCCCGCAATTCTGTTGGATATCG 
CAGGTGTCCTGGGAAACGAATGCAAGTAGATGATCTCAACATAAGTGGCATCATAGCAGCCGT 
AGTAGTTGTGGCCTTAGTGATTTCCGTTTGTGGCCTTGGTGTATGCTATGCTCAGAGGAAAGG 
CTACTTTTCAAAAGAAACCTCCTTCCAGAAGAGTAATTCTTCATCTAAAGCCACGACAATGAG 
TGAAAATGTGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGAAGGCCGCGGCGGGCGGATC 
ACGAGGTCAGGAGTTCTAGACCAGTCTGGCCAATATGGTGAAACCCCATCTCTACTAAAATAC 
AAAAATTAGCTGGGCATGGTGGCATGTGCCTGCAGTTCCAGCTGCTTGGGAGACAGGAGAATC 
ACTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCTGAGATCACGCCACTGCAGTCCAGCCTGGG 
TAACAGAGCAAGATTCCATCTCAAAAAATAAAATAAATAAATA7\ATAAATACTGGTTTTTACC 
TGTAGAATTCTTACAATAAATATAGCTTGATATTC 
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FIGURE 336 

MARRSRHRLLLLLLRYLVVALGYHKAYGFSAPKDQQWTAVEYQEAILACKTPKKTVSSRLEW 
KKLGRSVSFVYYQQTLQGDFKNRAEMIDFNIRIKNVTRSDAGKYRCEVSAPSEQGQNLEEDTV 
TLEVLVAPAVPSCEVPSSALSGTWELRCQDKEGNPAPEYTWFKDGIRLLENPRLGSQSTNSS 
YTMNTKTGTLQFNTVSKLDTGEYSCEARNSVGYRRCPGKRMQVDDLNISGIIAAVWVALVIS 
VCGLGVCYAQRKGYFSKETSFQKSNSSSKATTMSENVQWLTPVIPALWKAAAGGSRGQEF 

Important features : 
Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 130-144, 238-258 

N-glycosylation site. 

amino acids 98-102, 187-191, 236-240, 277-281 
Casein kinase II phosphorylation site. 

amino acids 39-43, 59-63, 100-104, 149-153, 205-209,-284-288 
N-myristoylation site. 

amino acids 182-188, 239-245, 255-261, 257-263, 305-311 

Amidation site. 

amino acids 226-230 
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FIGURE 337 

GGAGCCGCCCTGGGTGTCAGCGGCTCGGCTCCCGCGCACGCTCCGGCCGTCGCGCAGCCTCGG 
CACCTGCAGGTCCGTGCGTCCCGCGGCTGGCGCCCCTGACTCCGTCCCGGCCAGGGAGGGCCA 
TGATTTCCCTCCCGGGGCCCCTGGTGACCAACTTGCTGCGGTTTTTGTTCCTGGGGCTGAGTG 
CCCTCGCGCCCCCCTCGCGGGGCCAGCTGCAACTGCACTTGCCCGCCAACCGGTTGCAGGCGG 
TGGAGGGAGGGGAAGTGGTGCTTCCAGCGTGGTACACCTTGCACGGGGAGGTGTCTTCATCCC 
AGCCATGGGAGGTGCCCTTTGTGATGTGGTTCTTCAAACAGAAAGAAAAGGAGGATCAGGTGT 
TGTCCTACATCAATGGGGTCACAACAAGCAAACCTGGAGTATCCTTGGTCTACTCCATGCCCT 
CCCGGAACCTGTCCCTGCGGCTGGAGGGTCTCCAGGAGAAAGACTCTGGCCCCTACAGCTGCT 
CCGTGAATGTGCAAGAGAAACAAGGCAAATCTAGGGGCCACAGCATCAAAACCTTAGAACTCA 
ATGTACTGGTTCCTCCAGCTCCTCCATCCTGCCGTCTCCAGGGTGTGCCCCATGTGGGGGCAA 
ACGTGACCCTGAGCTGCCAGTCTCCAAGGAGTAAGCCCGCTGTCCAATACCAGTGGGATCGGC 
AGCTTCCATCCTTCCAGACTTTCTTTGCACCAGCATTAGATGTCATCCGTGGGTCTTTAAGCC 
TCACCAACCTTTCGTCTTCCATGGCTGGAGTCTATGTCTGCAAGGCCCACAATGAGGTGGGCA 
CTGCCCAATGTAATGTGACGCTGGAAGTGAGCACAGGGCCTGGAGCTGCAGTGGTTGCTGGAG 
CTGTTGTGGGTACCCTGGTTGGACTGGGGTTGCTGGCTGGGCTGGTCCTCTTGTACCACCGCC 
GGGGCAAGGCCCTGGAGGAGCCAGCCAATGATATCAAGGAGGATGCCATTGCTCCCCGGACCC 
TGCCCTGGCCCAAGAGCTCAGACACAATCTCCAAGAATGGGACCCTTTCCTCTGTCACCTCCG 
CACGAGCCCTCCGGCCACCeCATGGCCCTCCCAGGCCTGGTGCATTGACCCCCACGCCCAGTC 
TCTCCAGCCAGGCCCTGCCCTCACCAAGACTGCCCACGACAGATGGGGCCCACCCTCAACCAA 
TATCCCCCATCCCTGGTGGGGTTTCTTCCTCTGGCTTGAGCCGCATGGGTGCTGTGCCTGTGA 
TGGTGCCTGCCCAGAGTCAAGCTGGCTCTCTGGTATGATGACCCCAGCACTCATTGGCTAAAG 
GATTTGGGGTCTCTCCTTCCTATAAGGGTCACCTCTAGCACAGAGGCCTGAGTCATGGGAAAG 
AGTCACACTCCTGACCCTTAGTACTCTGCCCCCACCTCTCTTTACTGTGGGAAAACCATCTCA 
GTAAGACCTAAGTGTCCAGGAGACAGAAGGAGAAGAGGAAGTGGATCTGGAATTGGGAGGAGC 
CTCCACCCACCCCTGACTCCTCCTTATGAAGCCAGCTGCTGAAATTAGCTACTCACCAAGAGT 
GAGGGGCAGAGACTTCCAGTCACTGAGTCTCCCAGGCCCCCTTGATCTGTACCCCACCCCTAT 
CTAACACCACCCTTGGCTCCCACTCCAGCTCCCTGTATTGATATAACCTGTCAGGCTGGCTTG 
GTTAGGTTTTACTGGGGCAGAGGATAGGGAATCTCTTATTAAAACTAACATGAAATATGTGTT 
GTTTTCATTTGCAAATTTAAATAAAGATACATAATGTTTGTATGAAAAA 
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FIGURE 338 

MISLPGPLVTNLLRFLFLGLSALAPPSRAQLQLHLPANRLQAVEGGEVVLPAWYTLHGEVSSS 
QPWEVPFVMWFFKQKEKEDQVLSYINGVTTSKPGVSLVYSMPSRNLSLRLEGLQEKDSGPYSC 
SVNVQDKQGKSRGHSIKTLELNVLVPPAPPSCRLQGVPHVGANVTLSCQSPRSKPAVQYQWDR 
QLPSFQTFFAPALDVIRGSLSLTNLSSSMAGVYVCKAHNEVGTAQCNVTLEVSTGPGAAVVAG 
AWGTLVGLGLLAGLVLLYHRRGKALEEPANDIKEDAIAPRTLPWPKSSDTISKNGTLSSVTS 
ARALRPPHGPPRPGALTPTPSLSSQALPSPRLPTTDGAHPQPISPIPGGVSSSGLSRMGAVPV 
MVPAQSQAGSLV 

Important features: 
Signal peptide: 

amino acids 1-29 

Transmembrane domain : 

amino acids 245-267 

N-glycosylation site. 

amino acids 108-112, 169-173, 213-2.17, 236-240, 307-311 
N-myristoylation site. 

amino acids 90-96, 167-173, 220-226, 231-237, 252-258, 256-262, 
262-268, 308-314, 363-369, 364-370 

Prokaryotic membrane lipoprotein lipid attachment site . 

amino acids 164-175 



WO 01/40466 



PCT/US00/32678 



339/550 



FIGURE 339 

GCGAGAACCTTTGCACGCGCACAAACTACGGGGACGATTTCTGATTGATTTTTGGCGCTTTCGATCCACCCTCCT 
CCCTTCTCATGGGACTTTGGGGACAAAGCGTCCCGACCGCCTCGAGCGCTCGAGCAGGGCGCTATCCAGGAGCCA 
GGACAGCGTCGGGAACCAGACCATGGCTCCTGGACCCCAAGATCCTTAAGTTCGTCGTCTTCATCGTCGCGGTTC 
TGCTGCCGGTCCGGGTTGACTCTGCCACCATCCCCCGGCAGGACGAAGTTCCCCAGCAGACAGTGGCCCCACAGC 
AACAGAGGCGCAGCCTCAAGGAGGAGGAGTGTCCAGCAGGATCTCATAGATCAGAATATACTGGAGCCTGTAACC 
CGTGCACAGAGGGTGTGGATTACACCATTGCTTCCAACAATTTGCCTTCTTGCCTGCTATGTACAGTTTGTAAAT 
CAGGTCAAACAAATAAAAGTTCCTGTACCACGACCAGAGACACCGTGTGTCAGTGTGAAAAAGGAAGCTTCCAGG 
ATAAAAACTCCCCTGAGATGTGCCGGACGTGTAGAACAGGGTGTCCCAGAGGGATGGTCAAGGTCAGTAATTGTA 
CGCCCCGGAGTGACATCAAGTGCAAAAATGAATCAGCTGCCAGTTCCACTGGGAAAACCCCAGCAGCGGAGGAGA 
CAGTGACCACCATCCTGGGGATGCTTGCCTCTCCCTATCACTACCTTATCATCATAGTGGTTTTAGTCATCATTT 
TAGCTGTGGTTGTGGTTGGCTTTTCATGTCGGAAGAAATTCATTTCTTACCTCAAAGGCATCTGCTCAGGTGGTG 
GAGGAGGTCCCGAACGTGTGCACAGAGTCCTTTTCCGGCGGCGTTCATGTCCTTCACGAGTTCCTGGGGCGGAGG 
ACAATGCCCGCAACGAGACCCTGAGTAACAGATACTTGCAGCCCACCCAGGTCTCTGAGCAGGAAATCCAAGGTC 
AGGAGCTGGCAGAGCTAACAGGTGTGACTGTAGAGTCGCCAGAGGAGCCACAGCGTCTGCTGGAACAGGCAGAAG 
CTGAAGGGTGTCAGAGGAGGAGGCTGCTGGTTCCAGTGAATGACGCTGACTCCGCTGACATCAGCACCTTGCTGG 
ATGCCTCGGCAACACTGGAAGAAGGACATGCAAAGGAAACAATTCAGGACCAACTGGTGGGCTCCGAAAAGCTCT 
TTTATGAAGAAGATGAGGCAGGCTCTGCTACGTCCTGCCTGT^AAGAATCTCTTCAGGAAACCAGAGCTTCCCT 
CATTTACCTTTTCTCCTACAAAGGGAAGCAGCCTGGAAGAAACAGTCCAGTACTTGACCCATGCCCCAACAAACT 
CTACTATCCAATATGGGGCAGCTTACCAATGGTCCTAGAACTTTGTTAACGCACTTGGAGTAATTTTTATGAAAT 
ACTGCGTGTGATAAGCAAACGGGAGAAATTTATATCAGATTCTTGGCTGCATAGTTATACGATTGTGTATTAAGG 
GTCGTTTTAGGCCACATGCGGTGGCTCATGCCTGTAATCCCAGCACTTTGATAGGCTGAGGCAGGTGGATTGCTT 
GAG CTCGGGAGTTTGAGACCAGCCT CATC AACACAGTGAAACTCCATCTCAATTTAAAAAGAAAAAAAGTGGTTT 
TAGGATGTCATTCTTTGCAGTTCTTCATCATGAGACAAGTCTTTTTTTCTGCTTCTTATATTGCAAGCTCCATCT 
CTACTGGTGTGTGCATTTAATGACATCTAACTACAGATGCCGCACAGCCACAATGCTTTGCCTTATAGTTTTTTA 
ACTTTAGAACGGGATTATCTTGTTATTACCTGTATTTTCAGTTTCGGATATTTTTGACTTAATGATGAGATTATC 
AAGACGTAGCCCTATGCTAAGTCATGAGCATATGGACTTACGAGGGTTCGACTTAGAGTTTTGAGCTTTAAGATA 
GGATTATTGGGGCTTACCCCCACCTTAATTAGAGAAACATTTATATTGCTTACTACTGTAGGCTGTACATCTCTT 
TTCCGATTTTTGTATAATGATGTAAACATGGAAAAACTTTAGGAAATGCACTTATTAGGCTGTTTACATGGGTTG 
CCTGGATACAAATCAGCAGTCAAAAATGACTAAAAATATAACTAGTGACGGAGGGAGAAATCCTCCCTCTGTGGG 
AGGCACTTACTGCATTCCAGTTCTCCCTCCTGCGCCCTGAGACTGGACCAGGGTTTGATGGCTGGCAGCTTCTCA 
AGGGGCAGCTTGTCTTACTTGTTAATTTTAGAGGTATATAGCCATATTTATTTATAAATAAATATTTATTTATTT 
ATTTATAAGTAGATGTTTACATATGCCCAGGATTTTGAAGAGCCTGGTATCTTTGGGAAGCCATGTGTCTGGTTT 
GTCGTGCTGGGACAGTCATGGGACTGCATCTTCCGACTTGTCCACAGCAGATGAGGACAGTGAGAATTAAGTTAG 
ATCCGAGACTGCGAAGAGCTTCTCTTTCAAGCGCCATTACAGTTGAACGTTAGTGAATCTTGAGCCTCATTTGGG 
CTCAGGGCAGAGCAGGTGTTTATCTGCCCCGGCATCTGCCATGGCATCAAGAGGGAAGAGTGGACGGTGCTTGGG 
AATGGTGTGAAATGGTTGCCGACTCAGGCATGGATGGGCCCCTCTCGCTTCTGGTGGTCTGTGAACTGAGTCCCT 
GGGATGCCTTTTAGGGCAGAGATTCCTGAGCTGCGTTTTAGGGTACAGATTCCCTGTTTGAGGAGCTTGGCCCCT 
CTGTAAGCATCTGACTCATCTCAGAGATATCAATTCTTAAACACTGTGACAACGGGATCTAAAATGGCTGACACA 
TTTGTCCTTGTGTCACGTTCCATTATTTTATTTAAAAACCTCAGTAATCGTTTTAGCTTCTTTCCAGCAAACTCT 
TCTCCACAGTAGCCCAGTCGTGGTAGGATAAATTACGGATATAGTCATTCTAGGGGTTTCAGTCTTTTCCATCTC 
AAGGCATTGTGTGTTTTGTTCCGGGACTGGTTTGGCTGGGACAAAGTTAGAACTGCCTGAAGTTCGCACATTCAG 
ATTGTTGTGTCCATGGAGTTTTAGGAGGGGATGGCCTTTCCGGTCTTCGCACTTCCATCCTCTCCCACTTCCATC 
TGGCGTCCCACACCTTGTCCCCTGCACTTCTGGATGACACAGGGTGCTGCTGCCTCCTAGTCTTTGCCTTTGCTG 
GGCCTTCTGTGCAGGAGACTTGGTCTCAAAGCTCAGAGAGAGCCAGTCCGGTCCCAGCTCCTTTGTCCCTTCCTC 
AGAGGCGTTCCTTGAAGATGCATCTAGACTACCAGCCTTATCAGTGTTTAAGCTTATTCCTTTAACATAAGCTTC 
CTGACAACATGAAATTGTTGGGGTTTTTTGGCGTTGGTTGATTTGTTTAGGTTTTGCTTTATACCCGGGCCAAAT 
AGCACATAACACCTGGTTATATATGAAATACTCATATGTTTATGACCAAAATAAATATGAAACCTCATRTTAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 340 

MGLWGQSVPTASSARAGRYPGARTASGTRPWLLDPKILKFVVFIVAVLLPVRVDSATIPRQDEVPQQTVAPQQQR 
RSLKEEECPAGSHRSEYTGACNPCTEGVDYTIASNNLPSCLLCTVCKSGQTNKSSCTTTRDTVCQCEKGSFQDKN 
SPEMCRTCRTGCPRGMVKVSNCTPRSDIKCKNESAASSTGKTPAAEETVTTILGMLASPYHYLIIIVVLVIILAV 
VVVGFSCRKKFISYLKGICSGGGGGPERVHRVLFRRRSCPSRVPGAEDNARNETLSNRYLQPTQVSEQEIQGQEL 
AELTGVTVESPEEPQRLLEQAEAEGCQRRRLLVPVNDADSADISTLLDASATLEEGHAKETIQDQLVGSEKLFYE 
EDEAGSATSCL 

Important features: 
Transmembrane domains: 

amino acids 35-52, 208-230 

N-glycosylation sites. 

amino acids 127-131, 182-186, 277-281 

G lycos ami nogly can attachment site. 

amino acids 24 5-24 9 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 260-264 

N-myristoylation sites. 

amino acids 21-27, 86-92, 102-108, 161-167, 242-248, 270-276, 297-303, 380-386 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 185-193 

TNFR/NGFR cysteine-rich region. 

amino acids 99-139 
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FIGURE 341 

GCCTCTGAATTGTTGGGCAGTCTGGCAGTGGAGCTCTCCCCGGTCTGACAGCCACTCCAGAGG 
CCAT6CTTCGTTTCTTGCCAGATTTGGCTTTCAGCTTCCTGTTAATTCTGGCTTTGGGCCAGG 
CAGTCCAATTTCAAGAATATGTCTTTCTCCAATTTCTGGGCTTAGATAAGGCGCCTTCACCCC 
AGAAGTTCCAACCTGTGCCTTATATCTTGAAGAAAATTTTCCAGGATCGCGAGGCAGCAGCGA 
CCACTGGGGTCTCCCGAGACTTATGCTACGTAAAGGAGCTGGGCGTCCGCGGGAATGTACTTC 
GCTTTCTCCCAGACCAAGGTTTCTTTCTTTACCCAAAGAAAATTTCCCAAGCTTCCTCCTGCC 
TGCAGAAGCTCCTCTACTTTAACCTGTCTGCCATCAAAGAAAGGGAACAGTTGACATTGGCCC 
AGCTGGGCCTGGACTTGGGGCCCAATTCTTACTATAACCTGGGACCAGAGCTGGAACTGGCTC 
TGTTCCTGGTTCAGGAGCCTCATGTGTGGGGCCAGACCACCCCTAAGCCAGGTAAAATGTTTG 
TGTTGCGGTCAGTCCCATGGCCACAAGGTGCTGTTCACTTCAACCTGCTGGATGTAGCTAAGG 
ATTGGAATGACAACCCCCGGAAAAATTTCGGGTTATTCCTGGAGATACTGGTCAAAGAAGATA 
GAGACTCAGGGGTGAATTTTCAGCCTGAAGACACCTGTGCCAGACTAAGATGCTCCCTTCATG 
CTTCCCTGCTGGTGGTGACTCTCAACCCTGATCAGTGCCACCCTTCTCGGAAAAGGAGAGCAG 
CCATCCCTGTCCCCAAGCTTTCTTGTAAGAACCTCTGCCACCGTCACCAGCTATTCATTAACT 
TCCGGGACCTGGGTTGGCACAAGTGGATCATTGCCCCCAAGGGGTTCATGGCAAATTACTGCC 
ATGGAGAGTGTCCCTTCTCACTGACCATCTCTCTCAACAGCTCCAATTATGCTTTCATGCAAG 
CCCTGATGCATGCCGTTGACCCAGAGATCCCCCAGGCTGTGTGTATCCCCACCAAGCTGTCTC 
CCATTTCCAT-GCTCTACCAGGACAATAATGACAATGTCATTCTACGACATTATGAAGACATGG 
TAGTCGATGAATGTGGGTGTGGGTAGGATGTCAGAAATGGGAATAGAAGGAGTGTTCTTAGGG 
TAAATCTTTTAATAAAACTACCTATCTGGTTTATGACCACTTAGATCGAAATGTC 
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FIGURE 342 

MLRFLPDLAFSFLLILALGQAVQFQEYVFLQFLGLDKAPSPQKFQPVPYILKKIFQDREAAAT 
TGVSRDLCYVKELGVRGNVLRFLPDQGFFLYPKKISQASSCLQKLLYFNLSAIKEREQLTLAQ 
LGLDLGPNSYYNLGPELELALFLVQEPHVWGQTTPKPGKMFVLRSVPWPQGAVHFNLLDVAKD 
WNDNPRKNFGLFLEILVKEDRDSGVNFQPEDTCARLRCSLHASLLVVTLNPDQCHPSRKRRAA 
IPVPKLSCKNLCHRHQLFINFRDLGWHKWIIAPKGFMANYCHGECPFSLTISLNSSNYAFMQA 
LMHAVDPEIPQAVCIPTKLSPISMLYQDNNDNVILRHYEDMWDECGCG 

Important features: 
Signal peptide : 

amino acids 1-21 

N-glycosylation sites. 

amino acids 112-116, 306-310 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 96-100 

N-myristoylation site. 

amino acids 77-83 

TGF-beta family proteins. 

amino acids 264-299, 327-341, 345-364 
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FIGURE 343 

CCCACGCGTCCGGCCTTCTCTCTGGACTTTGCATTTCCATTCCTTTTCATTGACAAACTGACTTTTTTTATTTCT 
TTTTTTCCATCTCTGGGCCAGCTTGGGATCCTAGGCCGCCCTGGGAAGACATTTGTGTTTTACACACATAAGGAT 
CTGTGTTTGGGGTTTCTTCTTCCTCCCCTGACATTGGCATTGCTTAGTGGTTGTGTGGGGAGGGAGACCACGTGG 
GCTCAGTGCTTGCTTGCACTTATCTGCCTAGGTACATCGAAGTCTTTTGACCTCCATACAGTGATTATGCCTGTC 
ATCGCTGGTGGTATCCTGGCGGCCTTGCTCCTGCTGATAGTTGTCGTGCTCTGTCTTTACTTCAAAATACACAAC 
GCGCTAAAAGCTGCAAAGGAACCTGAAGCTGTGGCTGTAAAAAATCACAACCCAGACAAGGTGTGGTGGGCCAAG 
AACAGCCAGGCCA7^AACCATTGCCACGGAGTCTTGTCCTGCCCTGCAGTGCTGTGAAGGATATAGAATGTGTGCC 
AGTTTTGATTCCCTGCCACCTTGCTGTTGCGACATAAATGAGGGCCTCTGAGTTAGGAAAGGCTCCCTTCTCAAA 
GCAGAGCCCTGAAGACTTCAATGATGTCAATGAGGCCACCTGTTTGTGATGTGCAGGCACAGAAGAAAGGCACAG 
CTCCCCATCAGTTTCATGGAAAATAACTCAGTGCCTGCTGGGAACCAGCTGCTGGAGATCCCTACAGAGAGCTTC 
CACTGGGGGCAACCCTTCCAGGAAGGAGTTGGGGAGAGAGAACCCTCACTGTGGGGAATGCTGATAAACCAGTCA 
CACAGCTGCTCTATTCTCACACAAATCTACCCCTTGCGTGGCTGGAACTGACGTTTCCCTGGAGGTGTCCAGAAA 
GCTGATGTAACACAGAGCCTATAAAAGCTGTCGGTCCTTAAGGCTGCCCAGCGCCTTGCCAAAATGGAGCTTGTA 
AGAAGGCTCATGCCATTGACCCTCTTAATTCTCTCCTGTTTGGCGGAGCTGACAATGGCGGAGGCTGAAGGCAAT 
GCAAGCTGCACAGTCAGTCTAGGGGGTGCCAATATGGCAGAGACCCACAAAGCCATGATCCTGCAACTCAATCCC 
AGTGAGAACTGCACCTGGACAATAGAAAGACCAGAAAACAAAAGCATCAGAATTATCTTTTCCTATGTCCAGCTT 
GATCCAGATGGAAGCTGTGAAAGTGAAAACATTAAAGTCTTTGACGGAACCTCCAGCAATGGGCCTCTGCTAGGG 
CAAGTCTGCAGTAAAAACGACTATGTTCCTGTATTTGAATCATCATCCAGTACATTGACGTTTCAAATAGTTACT 
GACTCAGCAAGAATTCAAAGAACTGTCTTTGTCTTCTACTACTTCTTCTCTCCTAACATCTCTATTCCAAACTGT 
GGCGGTTACCTGGATACCTTGGAAGGATCCTTCACCAGCCCCAATTACCCAAAGCCGCATCCTGAGCTGGCTTAT 
TGTGTGTGGCACATACAAGTGGAGAAAGATTACAAGATAAAACTAAACTTCAAAGAGATTTTCCTAGAAATAGAC 
AAACAGTGC7VAATTTGATTTTCTTGCCATCTATGATGGCCCCTCCACCAACTCTGGCCTGATTGGACAAGTCTGT 
GGCCGTGTGACTCCCACCTTCGAATCGTCATCAAACTCTCTGACTGTCGTGTTGTCTACAGATTATGCCAATTCT 
TACCGGGGATTTTCTGCTTCCTACACCTCAATTTATGCAGAAAACATCAACACTACATCTTTAACTTGCTCTTCT 
GACAGGATGAGAGTTATTATAAGCAAATCCTACCTAGAGGCTTTTAACTCTAATGGGAATAACTTGCAACTAAAA 
GACCCAACTTGCAGACCAAAATTATCAAATGTTGTGGAATTTTCTGTCCCTCTTAATGGATGTGGTACAATCAGA 
AAGGTAGAAGATCAGTCAATTACTTACACCAATATAATCACCTTTTCTGCATCCTCAACTTCTGAAGTGATCACC 
CGTCAGAAACAACTCCAGATTATTGTGAAGTGTGAAATGGGACATAATTCTACAGTGGAGATAATATACATAACA 
GAAGATGATGTAATACAAAGTCAAAATGCACTGGGCAAATATAACACCAGCATGGCTCTTTTTGAATCCAATTCA 
TTTGAAAAGACTATACTTGAATCACCATATTATGTGGATTTGAACCAAACTCTTTTTGTTCAAGTTAGTCTGCAC 
ACCTCAGATCCAAATTTGGTGGTGTTTCTTGATACCTGTAGAGCCTCTCCCACCTCTGACTTTGCATCTCCAACC 
TACGACCTAATCAAGAGTGGATGTAGTCGAGATGAAACTTGTAAGGTGTATCCCTTATTTGGACACTATGGGAGA 
TTCCAGTTTAATGCCTTTAAATTGTTGAGAAGTATGAGCTCTGTGTATCTGCAGTGTAAAGTTTTGATATGTGAT 
AGCAGTGACCACCAGTCTCGCTGCAATCAAGGTTGTGTCTCCAGAAGCAAACGAGACATTTCTTCATATAAATGG 
AAAACAGATTCCATCATAGGACCCATTCGTCTGAAAAGGGATCGAAGTGCAAGTGGCAATTCAGGATTTCAGCAT 
GAAACACATGCGGAAGAAACTCCAAACCAGCCTTTCAACAGTGTGCATCTGTTTTCCTTCATGGTTCTAGCTCTG 
AATGTGGTGACTGTAGCGACAATCACAGTGAGGCATTTTGTAAATCAACGGGCAGACTACAAATACCAGAAGCTG 
CAGAACTATTAACTAACAGGTCCAACCCTAAGTGAGACATGTTTCTCCAGGATGCCAAAGGAAATGCTACCTCGT 
GGCTACACATATTATGAATAAATGAGGAAGGGCCTGAAAGTGACACACAGGCCTGCATGTAAAAAAA 
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FIGURE 344 

MELVRRLMPLTLLILSCLAELTMAEAEGNASCTVSLGGAN^4AETHKAMILQLNPSENCTWTIE 
RPENKSIRIIFSYVQLDPDGSCESENIKVFDGTSSNGPLLGQVCSKNDYVPVFESSSSTLTFQ 
IVTDSARIQRTVFVFYYFFSPNISIPNCGGYLDTLEGSFTSPNYPKPHPELAYCVWHIQVEKD 
YKIKLNFKEIFLEIDKQCKFDFLAIYDGPSTNSGLIGQVCGRVTPTFESSSNSLTVVLSTDYA 
NSYRGFSASYTSIYAENINTTSLTCSSDRMRVIISKSYLEAFNSNGNNLQLKDPTCRPKLSNV 
VEFSVPLNGCGTIRKVEDQSITYTNIITFSASSTSEVITRQKQLQIIVKCEMGHNSTVEIIYI 
TEDDVIQSQNALGKYNTSMALFESNSFEKTILESPYYVDLNQTLFVQVSLHTSDPNLWFLDT 
CRASPTSDFASPTYDLIKSGCSRDETCKVYPLFGHYGRFQFNAFKFLRSMSSVYLQCKVLICD 
SSDHQSRCNQGCVSRSKRDISSYKWKTDSIIGPIRLKRDRSAS'GNSGFQHETHAEETPNQPFN 
SVHLFS FMVLALNVVTVATITVRHFVNQRADYKYQKLQNY 

Important features: 
Signal sequence: 

amino acids 1-24 

Transmembrane domain: 

amino acids 571-586 

N-glycosylation site. 

amino acids 29-33, 57-61, 67-71, 148-152, 271-275, 370-374, 
394-398, 419-423 

Casein kinase II phosphorylation site. 

amino acids 22-26, 108-112, 289-293, 348-352, 371-375, 379-383, 
408-412, 463-467, 520-524, 556-560 

Tyrosine kinase phosphorylation site. 

amino acids 172-180, 407-415, 407-416, 519-528 

N-myristoylation site. 

amino acids 28-34, 38-44, 83-89, 95-101, 104-110, 226-232 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 7-18 
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FIGURE 345 

TGGGGGCCCCCCAGGCTCGCGCGTGGAGCGAAGCAGCATGGGCAGTCGGTGCGCGCTGGCCCTGGCGGTGCTCTC 

GGCCTTGCTGTGTCAGGTCTGGAGCTCTGGGGTGTTCGAACTGAAGCTGCAGGAGTTCGTCAACAAGAAGGGGCT 

GCTGGGGAACCGCAATTGCTGGCGCGGGGGCGCGGGGCCACCGCCGTGCGCCTGCCGGACCTTCTTCCGCGTGTG 

CCTCAAGCACTACCAGGCCAGCGTGTCCCCCGAGCCGCCCTGCACCTACGGCAGCGCCGTCACCCCCGTGCTGGG 

CGTCGACTCCTTCAGTCTGCCCGACGGCGGGGGCGCCGACT.CCGCGTT.CAGCAACCCCATCCGCTTCCCCTTCGG 

CTTCACCTGGCCGGGCACCTTCTCTCTGATTATTGAAGCTCTCCACACAGATTCTCCTGATGACCTCGCAACAGA 

AAACCCAGAAAGACTCATCAGCCGCCTGGCCACCCAGAGGCACCTGACGGTGGGCGAGGAGTGGTCCCAGGACCT 

GCACAGCAGCGGCCGCACGGACCTCAAGTACTCCTACCGCTTCGTGTGTGACGAACACTACTACGGAGAGGGCTG 

CTCCGTTTTCTGCCGTCCCCGGGACGATGCCTTCGGCCACTTCACCTGTGGGGAGCGTGGGGAGAAAGTGTGCAA 

CCCTGGCTGGAAAGGGCCCTACTGCACAGAGCCGATCTGCCTGCCTGGATGTGATGAGCAGCATGGATTTTGTGA 

CAAACCAGGGGAATGCAAGTGCAGAGTGGGCTGGCAGGGCCGGTACTGTGACGAGTGTATCCGCTATCCAGGCTG 

TCTCCATGGCACCTGCCAGCAGCCCTGGCAGTGCAACTGCCAGGAAGGCTGGGGGGGCCTTTTCTGCAACCAGGA 

CCTGAACTACTGCACACACCATAAGCCCTGCAAGAATGGAGCCACCTGCACCAACACGGGCCAGGGGAGCTACAC 

TTGCTCTTGCCGGCCTGGGTACACAGGTGCCACCTGCGAGCTGGGGATTGACGAGTGTGACCCCAGCCCTTGTAA 

GT^ACGGAGGGAGCTGCACGGATCTCGAGAACAGCTACTCCTGTACCTGCCCACCCGGCTTCTACGGCAAAATCTG 

TGAATTGAGTGCCATGACCTGTGCGGACGGCCCTTGCTTTAACGGGGGTCGGTGCTCAGACAGCCCCGATGGAGG 

GTACAGCTGCCGCTGCCCCGTGGGCTACTCCGGCTTCAACTGTGAGAAGAAAATTGACTACTGCAGCTCTTCACC 

CTGTTCTAATGGTGCCAAGTGTGTGGACCTCGGTGATGCCTACCTGTGCCGCTGCCAGGCCGGCTTCTCGGGGAG 

GCACTGTGACGACAACGTGGACGACTGCGCCTCCTCCCCGTGCGCCAACGGGGGCACCTGCCGGGATGGCGTGAA 

CGACTTCTCCTGCACCTGCCCGCCTGGCTACACGGGCAGGAACTGCAGTGCCCCCGTCAGCAGGTGCGAGCACGC 

ACCCTGCCACAATGGGGCCACCTGCCACGAGAGGGGCCACCGCTATGTGTGCGAGTGTGCCCGAGGCTACGGGGG 

TCCCAACTGCCAGTTCCTGCTCCCCGAGCTGCCCCCGGGCCCAGCGGTGGTGGACCTCACTGAGAAGCTAGAGGG 

CCAGGGCGGGCCATTCCCCTGGGTGGCCGTGTGCGCCGGGGTCATCCTTGTCCTCATGCTGCTGCTGGGCTGTGC 

CGCTGTGGTGGTCTGCGTCCGGCTGAGGCTGCAGAAGCACCGGCCCCCAGCCGACCCCTGCCGGGGGGAGACGGA 

GACCATGAACAACCTGGCCAACTGCCAGCGTGAGAAGGACATCTCAGTCAGCATCATCGGGGCCACGCAGATCAA 

GAACACCAACAAGAAGGCGGACTTCCACGGGGACCACAGCGCCGACAAGAATGGCTTCAAGGCCCGCTACCCAGC 

GGTGGACTATAACCTCGTGCAGGACCTCAAGGGTGACGACACCGCCGTCAGGGACGCGCACAGCAAGCGTGACAC 

CAAGTGCCAGCCCCAGGGCTCCTCAGGGGAGGAGAAGGGGACCCCGACCACACTCAGGGGTGGAGAAGCATCTGA 

AAGAAAAAGGCCGGACTCGGGCTGTTCAACTTCAAAAGACACCAAGTACCAGTCGGTGTACGTCATATCCGAGGA " 

GAAGGATGAGTGCGTCATAGCAACTGAGGTGTAAAATGGAAGTGAGATGGCAAGACTCCCGTTTCTCTTAAAATA 

AGTAAAATTCCAAGGATATATGCCCCAACGAATGCTGCTGAAGAGGAGGGAGGCCTCGTGGACTGCTGCTGAGAA 

ACCGAGTTCAGACCGAGCAGGTTCTCCTCCTGAGGTCCTCGACGCCTGCCGACAGCCTGTCGCGGCCCGGCCGCC 

TGCGGCACTGCCTTCCGTGACGTCGCCGTTGCACTATGGACAGTTGCTCTTAAGAGAATATATATTTAAATGGGT 

GAACTGAATTACGCATAAGAAGCATGCACTGCCTGAGTGTATATTTTGGATTCTTATGAGCCAGTCTTTTCTTGA 

ATTAGAAACACAAACACTGCCT.TTATTGTCCTTTTTGATACGAAGATGTGCTTTTTCTAGATGGAAAAGATGTGT 

GTTATTTTTTGGATTTGTAAAAATATTTTTCATGATATCTGTAAAGCTTGAGTATTTTGTGATGTTCGTTTTTTA 

TAATTTAAATTTTGGTAAATATGTACAAAGGCACTTCGGGTCTATGTGACTATATTTTTTTGTATATAAATGTAT 

TTATGGAATATTGTGCAAATGTTATTTGAGTTTTTTACTGTTTTGTTAATGAAGAAATTCCTTTTTAAAATATTT 

TTCCAAAATAAATTTTATGAATGACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAA 
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FIGURE 346 

MGSRCALALAVLSALLCQVWSSGVFELKLQEFVNKKGLLGNRNCCRGGAGPPPCACRTFFRVC 
LKHYQASVSPEPPCTYGSAVTPVLGVDSFSLPDGGGADSAFSNPIRFPFGFTWPGTFSLIIEA 
LHTDSPDDLATENPERLISRLATQRHLTVGEEWSQDLHSSGRTDLKYSYRFVCDEHYYGEGCS 
VFCRPRDDAFGHFTCGERGEKVCNPGWKGPYCTEPICLPGCDEQHGFCDKPGECKCRVGWQGR 
YCDECIRYPGCLHGTCQQPWQCNCQEGWGGLFCNQDLNYCTHHKPCKNGATCTNTGQGSYTCS 
CRPGYTGATCELGIDECDPSPCKNGGSCTDLENSYSCTCPPGFYGKICELSAMTCADGPCFNG 
GRCSDSPDGGYSCRCPVGYSGFNCEKKIDYCSSSPCSNGAKCVDLGDAYLCRCQAGFSGRHCD 
DNVDDCASSPCANGGTCRDGVNDFSCTCPPGYTGRNCSAPVSRCEHAPCHNGATCHERGHRYV 
CECARGYGGPNCQFLLPELPPGPAVVDLTEKLEGQGGPFPWVAVCAGVILVLMLLLGCAAVVV 
CVRLRLQKHRPPADPCRGETETMNNLANCQREKDISVSIIGATQIKNTNKKADFHGDHSADKN 
GFKARYPAVDYNLVQDLKGDDTAVRDAHSKRDTKCQPQGSSGEEKGTPTTLRGGEASERKRPD 
SGCSTSKDTKYQSVYVISEEKDECVIATEV 

Important features: 
Signal sequence: 

Amino acids 1-21 

Transmembrane domain: 

Amino acids 546-566 

N-glycosylation site: 

Amino acids 477-481 

cAMP- and cGMP-dependent protein kinase phosphorylation site: 

Amino acids 660-664 

Tyrosine kinase phosphorylation sites: 

Amino acids 176-185; 252-261 

N-myristoylation sites: 

Amino acids 2-8 ; 37-43; 40-4 6; 98-104 ; 99-105; 262-268; 281-287 ; 
282-288; 301-307; 310-316; 328-334; 340-344; 378-384; 387-393; 512-518; 
676-682; 683-689; 695-701 

Aspartic acid and asparagine hydroxylation sites: 

Amino acids 343-355; 420-432; 458-470 

Prokaryotic membrane lipoprotein lipid attachment site: 

Amino acids 552-563 

EGF-like domain cysteine pattern signature: 

Amino acids 243-255 ; 274-286; 314-326; 352-364 ; 391-403; 429-441-; 
467-479;505-517 
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FIGURE 347 

CCCACGCGTCCGCACCTCGGCCCCGGGCTCCGAAGCGGCTCGGGGGCGCCCTTTCGGTCAACA 
TCGTAGTCCACCCCCTCCCCATCCCCAGCCCCCGGGGATTCAGGCTCGCCAGCGCCCAGCCAG 
GGAGCCGGCCGGGAAGCGCGMTCGGGCCCCAGCCGCCTCGCTCCTGCTCCTGCTCCTGCTGT 
TCGCCTGCTGCTGGGCGCCCGGCGGGGCCAACCTCTCCCAGGACGACAGCCAGCCCTGGACAT 
CTGATGAAACAGTGGTGGCTGGTGGCACCGTGGTGCTCAAGTGCCAAGTGAAAGATCACGAGG 
ACTCATCCCTGCAATGGTCTAACCCTGCTCAGCAGACTCTCTACTTTGGGGAGAAGAGAGCCC 
TTCGAGATAATCG7\ATTCAGCTGGTTACCTCTACGCCCCACGAGCTCAGCATCAGCATCAGCA 
ATGTGGCCCTGGCAGACGAGGGCGAGTACACCTGCTCAATCTTCACTATGCCTGTGCGAACTG 
CCAAGTCCCTCGTCACTGTGCTAGGAATTCCACAGAAGCCCATCATCACTGGTTATAAATCTT 
CATTACGGGAAAAAGACACAGCCACCCTAAACTGTCAGTCTTCTGGGAGCAAGCCTGCAGCCC 
GGCTCACCTGGAGAAAGGGTGACCAAGAACTCCACGGAGAACCAACCCGCATACAGGAAGATC 
CCAATGGTAAAACCTTCACTGTCAGCAGCTCGGTGACATTCCAGGTTACCCGGGAGGATGATG 
GGGCGAGCATCGTGTGCTCTGTGAACCATGAATCTCTAAAGGGAGCTGACAGATCCACCTCTC 
AACGCATTGAAGTTTTATACACACCAACTGCGATGATTAG'GCCAGACCCTCCCCATCCTCGTG 
AGGGCCAGAAGCTGTTGCTACACTGTGAGGGTCGCGGCAATCCAGTCCCCCAGCAGTACCTAT 
GGGAGAAGGAGGGCAGTGTGCCACCCCTGAAGATGACCCAGGAGAGTGCCCTGATCTTCCCTT 
TCCTCAACAAGAGTGACAGTGGCACCTACGGCTGCACAGCCACCAGCAACATGGGCAGCTACA 
AGGCCTACTACACCCTCAATGTTAATGACCCCAGTCCGGTGCCCTCCTCCTCCAGCACCTACC 
ACGCCATCATCGGTGGGATCGTGGCTTTCATTGTCTTCCTGCTGCTCATCATGCTCATCTTCC 
TTGGCCACTACTTGATCCGGCACAAAGGAACCTACCTGACACATGAGGCAAAAGGCTCCGACG 
ATGCTCCAGACGCGGACACGGCCATCATCAATGCAGAAGGCGGGCAGTCAGGAGGGGACGACA 
AGAAGGAATATTTCATCTAGAGGCGCCTGCCCACTTCCTGCGCCCCCCAGGGGCCCTGTGGGG 
ACTGCTGGGGCCGTCACCAACCCGGACTTGTACAGAGCAACCGCAGGGCCGCCCCTCCCGCTT 
GCTCCCCAGCCCACCCACCCCCCTGTACAGAATGTCTGCTTTGGGTGCGGTTTTGTACTCGGT 
TTGGAATGGGGAGGGAGGAGGGCGGGGGGAGGGGAGGGTTGCCCTCAGCCCTTTCCGTGGCTT 
CTCTGCATTTGGGTTATTATTATTTTTGTAACAATCCCAAATCAAATCTGTCTCCAGGCTGGA 
GAGGCAGGAGCCCTGGGGTGAGAAAAGCAAAAAACAAACAAAAAACA 
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MGAPAASLLLLLLLFACCWAPGGANLSQDDSQPWTSDETWAGGTVVLKCQVKDHEDSSLQWS 
NPAQQTLYFGEKRALRDNRIQLVTSTPHELSISISNVALADEGEYTCSIFTMPVRTAKSLVTV 
LGIPQKPIITGYKSSLREKDTATLNCQSSGSKPAARLTWRKGDQELHGEPTRIQEDPNGKTFT 
VSSSVTFQVTREDDGASIVCSVNHESLKGADRSTSQRIEVLYTPTAMIRPDPPHPREGQKLLL 
HCEGRGNPVPQQYLWEKEGSVPPLKMTQESALIFPFLNKSDSGTYGCTATSNMGSYKAYYTLN 
VNDPSPVPSSSSTYHAIIGGIVAFIVFLLLIMLIFLGHYLIRHKGTYLTHEAKGSDDAPDADT 
AIINAEGGQSGGDDKKEYFI 

Important features: 
Signal sequence: 

amino acids 1-20 

Transmembrane domain: 

amino acids 331-352 

N-glycosylation site. 

amino acids 25-29, 290-294 

Casein kinase II phosphorylation site. 

amino acids 27-31, 35-39, 89-93, 141-145, 199-203, 388-392 
N-myristoylation site. 

amino acids 2-8, 23-29, 156-162, 218-224, 295-301, 298-304, 
306-310, 334-340, 360-364, 385-389, 386-390 

Prokaryotic m emb rane lipoprotein lipid attachment site. 

amino acids 7-18 



WO 01/40466 



PCT/US00/32678 



349/550 

FIGURE 349 

ACTTGCCATCACCTGTTGCCAGTGTGGAAAAATTCTCCCTGTTGAATTTTTTGCACATGGAGGACAGCAGCAAAG 
AGGGCAACACAGGCTGATAAGACCAGAGACAGCAGGGAGATTATTTTACCATACGCCCTCAGGACGTTCCCTCTA 
GCTGGAGTTCTGGACTTCAACAGAACCCCATCCAGTCATTTTGATTTTGCTGTTTATTTTTTTTTTCTTTTTCTT 
TTTCCCACCACATTGTATTTTATTTCCGTACTTCAGAAATGGGCCTACAGACCACAAAGTGGCCCAGCCATGGGG 
CTTTTTTCCTGAAGTCTTGGCTTATCATTTCCCTGGGGCTCTACTCACAGGTGTCCAAACTCCTGGCCTGCCCTA 
GTGTGTGCCGCTGCGACAGGAACTTTGTCTACTGTAATGAGCGAAGCTTGACCTCAGTGCCTCTTGGGATCCCGG 
AGGGCGTAACCGTACTCTACCTCCACAACAACCAAATTAATAATGCTGGATTTCCTGCAGAACTGCACAATGTAC 
AGTCGGTGCACACGGTCTACCTGTATGGCAACCAACTGGACGAATTCCCCATGAACCTTCCCAAGAATGTCAGAG 
TTCTCCATTTGCAGGAAAACAATATTCAGACCATTTCACGGGCTGCTCTTGCCCAGCTCTTGAAGCTTGAAGAGC 
TGCACCTGGATGACAACTCCATATCCACAGTGGGGGTGGAAGACGGGGCCTTCCGGGAGGCTATTAGCCTCAAAT 
TGTTGTTTTTGTCTAAGAATCACCTGAGCAGTGTGCCTGTTGGGCTTCCTGTGGACTTGCAAGAGCTGAGAGTGG 
ATGAAAATCGAATTGCTGTCATATCCGACATGGCCTTCCAGAATCTCACGAGCTTGGAGCGTCTTATTGTGGACG 
GGAACCTCCTGACCAACAAGGGTATCGCCGAGGGCACCTTCAGCCATCTCACCAAGCTCAAGGAATTTTCAATTG 
TACGTAATTCGCTGTCCGACCCTCCTCCCGATCTCCCAGGTACGCATCTGATCAGGCTCTATTTGCAGGACAACC 
AGATAAACCACATTCCTTTGACAGCCTTCTCAAATCTGCGTAAGCTGGAACGGCTGGATATATCCAACAACCAAC 
TGCGGATGCTGACTCAAGGGGTTTTTGATAATCTCTCCAACCTGAAGCAGCTCACTGCTCGGAATAACCCTTGGT 
TTTGTGACTGCAGTATTAAATGGGTCACAGAATGGCTCAAATATATCCCTTCATCTCTCAACGTGCGGGGTTTCA 
TGTGCCAAGGTCCTGAACAAGTCCGGGGGATGGCCGTCAGGGAATTAAATATGAATCTTTTGTCCTGTCCCACCA 
CGACCCCCGGCCTGCCTCTCTTCACCCCAGCCCCAAGTACAGCTTCTCCGACCACTCAGCCTCCCACCCTCTCTA 
TTCCAAACCCTAGCAGAAGCTACACGCCTCCAACTCCTACCACATCGAAACTTCCCACGATTCCTGACTGGGATG 
GCAGAGAAAGAGTGACCCCACCTATTTCTGAACGGATCCAGCTCTCTATCCATTTTGTGAATGATACTTCCATTC 
AAGTCAGCTGGCTCTCTCTCTTCACCGTGATGGCATACAAACTCACATGGGTGAAAATGGGCCACAGTTTAGTAG 
GGGGCATCGTTCAGGAGCGCATAGTCAGCGGTGAGAAGCAACACCTGAGCCTGGTTAACTTAGAGCCCCGATCCA 
CCTATCGGATTTGTTTAGTGCCACTGGATGCTTTTAACTACCGCGCGGTAGAAGACACCATTTGTTCAGAGGCCA 
CCACCCATGCCTCCTATCTGAACAACGGCAGCAACACAGCGTCCAGCCATGAGCAGACGACGTCCCACAGCATGG 
GCTCCCCCTTTCTGCTGGCGGGCTTGATCGGGGGCGCGGTGATATTTGTGCTGGTGGTCTTGCTCAGCGTCTTTT 
GCTGGCATATGCACAAAAAGGGGCGCTACACCTCCCAGAAGTGGAAATACAACCGGGGCCGGCGGAAAGATGATT 
ATTGCGAGGCAGGCACCAAGAAGGACAACTCCATCCTGGAGATGACAGAAACCAGTTTTCAGATCGTCTCCTTAA 
ATAACGATCAACTCCTTAAAGGAGATTTCAGACTGCAGCCCATTTACACCCCAAATGGGGGCATTAATTACACAG 
ACTGCCATATCCCCAACAACATGCGATACTGCAACAGCAGCGTGCCAGACCTGGAGCACTGCCATACGTGACAGC 
CAGAGGCCCAGCGTTATCAAGGCGGACAATTAGACTCTTGAGAACACACTCGTGTGTGCACATAAAGACACGCAG 
ATTACATTTGATAAATGTTACACAGATGCATTTGTGCATTTGAATACTCTGTAATTTATACGGTGTACTATATAA 
TGGGATTTAAAAAAAGTGCTATCTTTTCTATTTCAAGTTAATTACAAACAGTTTTGTAACTCTTTGCTTTTTAAA 
TCTT 
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MGLQTTKWPSHGAFFLKSWLIISLGLYSQVSKLLACPSVCRCDRNFVYCNERSLTSVPLGIPE 
GVTVLYLHNNQINNAGFPAELHNVQSVHTVYLYGNQLDEFPMNLPKNVRVLHLQENNIQTISR 
AALAQLLKLEELHLDDNSISTVGVEDGAFREAISLKLLFLSKNHLSSVPVGLPVDLQELRVDE 
NRIAVISDMAFQNLTSLERLIVDGNLLTNKGIAEGTFSHLTKLKEFSIVRNSLSHPPPDLPGT 
HLIRLYLQDNQINHIPLTAFSNLRKLERLDISNNQLRMLTQGVFDNLSNLKQLTARNNPWFCD 
CSIKWVTEWLKYIPSSLNVRGFMCQGPEQVRGMAVRELNMNLLSCPTTTPGLPLFTPAPSTAS 
PTTQPPTLSIPNPSRSYTPPTPTTSKLPTIPDWDGRERVTPPISERIQLSIHFVNDTSIQVSW 
LSLFTVMAYKLTWVKMGHSLVGGIVQERIVSGEKQHLSLVNLEPRSTYRICLVPLDAFNY'RAV 
EDTICSEATTHASYLNNGSNTASSHEQTTSHSMGSPFL-LAGLIGGAVIFVLWLLSVFCWHMH 
KKGRYTSQKWKYNRGRRKDDYCEAGTKKDNSILEMTETSFQIVSLNNDQLLKGDFRLQPIYTP 
NGGINYTDCHIPNNMRYCNSSVPDLEHCHT 

Important features: 
Signal peptide: 

amino acids 1-42 
Transmembrane domain: 
amino acids 542-561 
N-glycosylation site. 

amino acids 202-206, 298-302, 433-437,-521-525, 635-639, 649-653 
Casein kinase II phosphorylation site. 

amino acids 204-208, 407-411, 527-531, 593-597, 598-602, 651-655 
Tyrosine kinase phosphorylation site . 
amino acids 319-328 
N-myristoylation site. 

amino acids 2-8, 60-66, 149-155, 213-219, 220-226, 294-300, 

522-528, 545-551, 633-639 

Amidation site. 

amino acids 581-585 

Leucine zipper pattern. 

amino acids 164-186 

Phospholipase A2 aspartic acid active site. 

amino acids 39-50 
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AGCCGACGCTGCTCAAGCTGCAACTCTGTTGCAGTTGGCAGTTCTTTTCGGTTTCCCTCCTGCTGTTTGGGGGCA 
TGAAAGGGCTTCGCCGCCGGGAGTAAAAGAAGGAATTGACCGGGCAGCGCGAGGGAGGAGCGCGCACGCGACCGC 
GAGGGCGGGCGTGCACCCTCGGCTGGAAGTTTGTGCCGGGCCCCGAGCGCGCGCCGGCTGGGAGCTTCGGGTAGA 
GACCTAGGCCGCTGGACCGCGATGAGCGCGCCGAGCCTCCGTGCGCGCGCCGCGGGGTTGGGGCTGCTGCTGTGC 
GCGGTGCTGGGGCGCGCTGGCCGGTCCGACAGCGGCGGTCGCGGGGAACTCGGGCAGCCCTCTGGGGTAGCCGCC 
GAGCGCCCATGCCCCACTACCTGCCGCTGCCTCGGGGACCTGCTGGACTGCAGTCGTAAGGGGCTAGCGCGTCTT 
CCCGAGCCACTCCCGTCCTGGGTCGCTCGGCTGGACTTAAGTCACAACAGATTATCTTTCATCAAGGCAAGTTCC 
ATGAGCCACCTTCAAAGCCTTCGAGAAGTGAAACTGAACAACAATGAATTGGAGACCATTCCAAATCTGGGACCA 
GTCTCGGCAAATATTACACTTCTCTCCTTGGCTGGAAACAGGATTGTTGAAATACTCCCTGAACATCTGAAAGAG 
TTTCAGTCCCTTGAAACTTTGGACCTTAGCAGCAACAATATTTCAGAGCTCCAAACTGCATTTCCAGCCCTACAG 
CTCAAATATCTGTATCTCAACAGCAACCGAGTCACATCAATGGAACCTGGGTATTTTGACAATTTGGCCAACACA 
CTCCTTGTGTTAAAGCTGAACAGGAACCGAATCTCAGCTATCCCACCCAAGATGTTTAAACTGCCCCAACTGCAA 
CATCTCGAATTGAACCGAAACAAGATTAAAAATGTAGATGGACTGACATTCCAAGGCCTTGGTGCTCTGAAGTCT 
CTGAAAATGCAAAGAAATGGAGTAACGAAACTTATGGATGGAGCTTTTTGGGGGCTGAGCAACATGGAAATTTTG 
CAGCTGGACCATAACAACCTAACAGAGATTACCAAAGGCTGGCTTTACGGCTTGCTGATGCTGCAGGAACTTCAT 
CTCAGCCAAAATGCCATCAACAGGATCAGCCCTGATGCCTGGGAGTTCTGCCAGAAGCTCAGTGAGCTGGACCTA 
ACTTTCAATCACTTATCAAGGTTAGATGATTCAAGCTTCCTTGGCCTAAGCTTACTAAATACACTGCACATTGGG 
AACAACAGAGTCAGCTACATTGCTGATTGTGCCTTCCGGGGGCTTTCCAGTTTAAAGACTTTGGATCTGAAGAAC 
AATGAAATTTCCTGGACTATTGAAGACATGAATGGTGCTTTCTCTGGGCTTGACAAACTGAGGCGACTGATACTC 
CAAGGAAATCGGATCCGTTCTATTACTAAAAAAGCCTTCACTGGTTTGGATGCATTGGAGCATGTAGACCTGAGT 
GACAACGCAATCATGTCTTTACAAGGCAATGCATTTTCACAAATGAAGAAACTGCAACAATTGCATTTAAATACA 
TCAAGCCTTTTGTGCGATTGCCAGCTAAAATGGCTCCCACAGTGGGTGGCGGAAAACAACTTTCAGAGCTTTGTA 
AATGCCAGTTGTGCCCATCCTCAGCTGCTAAAAGGAAGAAGCATTTTTGCTGTTAGCCCAGATGGCTTTGTGTGT 
GATGATTTTCCCAAACCCCAGATCACGGTTCAGCCAGAAACACAGTCGGCAATAAAAGGTTCCAATTTGAGTTTC 
ATCTGCTCAGCTGCCAGCAGCAGTGATTCCCCAATGACTTTTGCTTGGAAAAAAGACAATGAACTACTGCATGAT 
GCTGAAATGGAAAATTATGCACACCTCCGGGCCCAAGGTGGCGAGGTGATGGAGTATACCACCATCCTTCGGCTG 
CGCGAGGTGGAATTTGCCAGTGAGGGGT^AATATCAGTGTGTCATCTCCAATCACTTTGGTTCATCCTACTCTGTC 
AAAGCCAAGCTTACAGTAAATATGCTTCCCTCATTCACCAAGACCCCCATGGATCTCACCATCCGAGCTGGGGCC 
ATGGCACGCTTGGAGTGTGCTGCTGTGGGGCACCCAGCCCCCCAGATAGCCTGGCAGAAGGATGGGGGCACAGAC 
TTCCCAGCTGCACGGGAGAGACGCATGCATGTGATGCCCGAGGATGACGTGTTCTTTATCGTGGATGTGAAGATA 
GAGGACATTGGGGTATACAGCTGCACAGCTCAGAACAGTGCAGGAAGTATTTCAGCAAATGCAACTCTGACTGTC 
CTAGAAACACCATCATTTTTGCGGCCACTGTTGGACCGAACTGTAACCAAGGGAGAAACAGCCGTCCTACAGTGC 
ATTGCTGGAGGAAGCCCTCCCCCTAAACTGAACTGGACCAAAGATGATAGCCCATTGGTGGTAACCGAGAGGCAC 
TTTTTTGCAGCAGGCAATCAGCTTCTGATTATTGTGGACTCAGATGTCAGTGATGCTGGGAAATACACATGTGAG 
ATGTCTAACACCCTTGGCACTGAGAGAGGAAACGTGCGCCTCAGTGTGATCCCCACTCCAACCTGCGACTCCCCT 
CAGATGACAGCCCCATCGTTAGACGATGACGGATGGGCCACTGTGGGTGTCGTGATCATAGCCGTGGTTTGCTGT 
GTGGTGGGCACGTCACTCGTGTGGGTGGTCATCATATACCACACAAGGCGGAGGAATGAAGATTGCAGCATTACC 
AACACAGATGAGACCAACTTGCCAGCAGATATTCCTAGTTATTTGTCATCTCAGGGAACGTTAGCTGACAGGCAG 
GATGGGTACGTGTCTTCAGAAAGTGGAAGCCACCACCAGTTTGTCACATCTTCAGGTGCTGGATTTTTCTTACCA 
CAACATGACAGTAGTGGGACCTGCCATATTGACAATAGCAGTGAAGCTGATGTGGAAGCTGCCACAGATCTGTTC 
CTTTGTCCGTTTTTGGGATCCACAGGCCCTATGTATTTGAAGGGAAATGTGTATGGCTCAGATCCTTTTGAAACA 
TATCATACAGGTTGCAGTCCTGACCCAAGAACAGTTTTAATGGACCACTATGAGCCCAGTTACATAAAGAAAAAG 
GAGTGCTACCCATGTTCTCATCCTTGAGAAGAATCCTGCGAACGGAGCTTCAGTAATATATCGTGGCCTTCACAT 
GTGAGGAAGCTACTTAACACTAGTTACTCTCACAATGAAGGACCTGG7VATGAAAAATCTGTGTCTAAACAAGTCC 
TCTTTAGATTTTAGTGCAAATCCAGAGCCAGCGTCGGTTGCCTCGAGTAATTCTTTCATGGGTACCTTTGGA7VAA 
GCTCTCAGGAGACCTCACCTAGATGCCTATTCAAGCTTTGGACAGCCATCAGATTGTCAGCCAAGAGCCTTTTAT 
TTGAAAGCTCATTCTTCCCCAGACTTGGACTCTGGGTCAGAGGAAGATGGGAAAGAAAGGACAGATTTTCAGGAA 
GAAAATCACATTTGTACCTTTAAACAGACTTTAGAAAACTACAGGACTCCAAATTTTCAGTCTTATGACTTGGAC 
AC ATAGA CTGAATGAGACCAAAGGAAAAGCTTAACATACTACCTCAAGTGAACTTTTATTTAAAAGAGAGAGAAT 
CTTATGTTTTTTAAATGGAGTTATGAATTTTAAAAGGATAAAAATGCTTTATTTATACAGATGAACCAAAATTAC 
AAAAAGTTATGAAAATTTTTATACTGGGAATGATGCTCATATAAGAATACCTTTTTAAACTATTTTTTAACTTTG 
TTTTATGCAAAAAAGTATCTTACGTAAATTAATGATATAAATCATGATTATTTTATGTATTTTTATAATGCCAGA 
TTTCTTTTTATGGAAAATGAGTTACTAAAGCATTTTAAATAATACCTGCCTTGTACCATTTTTTAAATAGAAGTT 
ACTTCATTATATTTTGCACATTATATTTAATAAAATGTGTCAATTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MSAPSLRARAAGLGLLLCAVLGRAGRSDSGGRGELGQPSGVAAERPCPTTCRCLGDLLDCSRKRLARLPEPLPSW 
VARLDLSHNRLSFIKASSMSHLQSLREVKLNNNELETIPNLGPVSANITLLSLAGNRIVEILPEHLKEFQSLETL 
DLSSNNISELQTAFPALQLKYLYLNSNRVTSMEPGYFDNLANTLLVLKLNRNRISAIPPKMFKLPQLQHLELNRN 
KIKNVDGLTFQGLGALKSLKMQRNGVTKLMDGAFWGLSNMEILQLDHNNLTEITKGWLYGLLMLQELHLSQNAIN 
RISPDAWEFCQKLSELDLTFNHLSRLDDSSFLGLSLLNTLHIGNNRVSYIADCAFRGLSSLKTLDLKNNEISWTI 
EDMNGAFSGLDKLRRLILQGNRIRSITKKAFTGLDALEHLDLSDNAIMSLQGNAFSQMKKLQQLHLNTSSLLCDC 
QLKWLPQWVAENNFQSFVNASCAHPQLLKGRSIFAVSPDGFVCDDFPKPQITVQPETQSAIKGSNLSFICSAASS 
SDSPMTFAWKKDNELLHDAEMENYAHLRAQGGEVMEYTTILRLREVEFASEGKYQCVISNHFGSSYSVKAKLTVN 
ML PS FT KT PMDLT I RAGAMARLECAAVGH PAPQ IAWQKDGGT D FPAARE RRMH VMPE DD VF F I VDVK I ED I GVY S 
CTAQNSAGSISANATLTVLETPSFLRPLLDRTVTKGETAVLQCIAGGSPPPKLNWTKDDSPLVVTERHFFAAGNQ 
LLIIVDSDVSDAGKYTCEMSNTLGTERGNVRLSVIPTPTCDSPQMTAPSLDDDGWATVGWIIAWCCWGTSLV 
WWI I YHTRRRNEDCS ITNTDETNLPADI PS YLSSQGTLADRQDGYVSSESGSHHQFVTSSGAGFFLPQHDSSGT 
CHIDNSSEADVEAATDLFLCPFLGSTGPMYLKGNVYGSDPFETYHTGCSPDPRTVLMDHYEPSYIKKKECYPCSH 
PSEESCERSFSNISWPSHVRKLLNTSYSHNEGPGMKNLCLNKSSLDFSANPEPASVASSNSFMGTFGKALRRPHL 
DAYSSFGQPSDCQPRAFYLKAHSSPDLDSGSEEDGKERTDFQEENHICTFKQTLENYRTPNFQSYDLDT 

Important features : 
Signal sequence : 

amino acids 1-27 

Transmembrane domain: 

amino acids 808-828 

N-glycosylation site. 

amino acids 122-126, 156-160, 274-278, 442-446, 469-473, 515-519, 
688-692, 729-733, 905-909, 987-991, 999-1003, 1016-1020 

Glycosaminoglycan attachment site. 

amino acids 886-890 

Casein kinase II phosphorylation site. 

amino acids 99-103, 180-184, 263-267, 314-318, 324-328, 374-378, 

383-387, 407-411, 524-528, 608-612, 692-696, 709-713, 731-735, 
799-803, 843-847, 863-867, 907-911, 1003-1007, 1018-1022, 
1073-1077, 1079-1083, 1081-1085 

Tyrosine kinase phosphorylation site. 

amino acids 667-675 

,N-myristoylation site. 

amino acids 14-20, 36-42, 239-245, 257-263, 380-386, 427-433, 
513-519,-588-594, 672-678, 683-687, 774-780, 933-939 

Leucine zipper pattern. 

amino acids 58-80, 65-87 
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GGGGGTTAGGGAGGAAGGAATCGACCCCCACCCCCCCAAACCCTTTTCTTCTCCTTTCCTGGCTTCGGACATTGG 
AGCACTAAATGAACTTGAATTGTGTCTGTGGCGAGCAGGATGGTCGCTGTTACTTTGTGATGAGATCGGGGATGA 
ATTGCTCGCTTTAAAAATGCTGCTTTGGATTCTGTTGCTGGAGACGTCTCTTTGTTTTGCCGCTGGAAACGTTAC 
AGGGGACGTTTGCAAAGAGAAGATCTGTTCCTGCAATGAGATAGAAGGGGACCTACACGTAGACTGTGAAAAAAA 
GGGCTTCAC7VAGTCTGCAGCGTTTCACTGCCCCGACTTCCCAGTTTTACCATTTATTTCTGCATGGCAATTCCCT 
CACTCGACTTTTCCCTAATGAGTTCGCTAACTTTTATAATGCGGTTAGTTTGCACATGGAAAACAATGGCTTGCA 
TGAAATCGTTCCGGGGGCTTTTCTGGGGCTGCAGCTGGTGAAAAGGCTGCACATCAACAACAACAAGATCAAGTC 
TTTTCGAAAGCAGACTTTTCTGGGGCTGGACGATCTGGAATATCTCCAGGCTGATTTTAATTTATTACGAGATAT 
AGACCCGGGGGCCTTCCAGGACTTGAACAAGCTGGAGGTGCTCATTTTAAATGACAATCTCATCAGCACCCTACC 
TGCCAACGTGTTCCAGTATGTGCCCATCACCCACCTCGACCTCCGGGGTAACAGGCTGAAAACGCTGCCCTATGA 
GGAGGTCTTGGAGCAAATCCCTGGTATTGCGGAGATCCTGCTAGAGGATAACCCTTGGGACTGCACCTGTGATCT 
GCTCTCCCTGAAAGAATGGCTGGAAAACATTCCCAAGAATGCCCTGATCGGCCGAGTGGTCTGCGAAGCCCCCAC 
CAGACTGCAGGGTAAAGACCTCAATGAAACCACCGAACAGGACTTGTGTCCTTTGAAAAACCGAGTGGATTCTAG 
TCTCCCGGCGCCCCCTGCCCAAGAAGAGACCTTTGCTCCTGGACCCCTGCCAACTCCTTTCAAGACAAATGGGCA 
AGAGGATCATGCCACACCAGGGTCTGCTCCAAACGGAGGTACAAAGATCCCAGGCAACTGGCAGATCAAAATCAG 
ACCCACAGCAGCGATAGCGACGGGTAGCTCCAGGAACAAACCCTTAGCTAACAGTTTACCCTGCCCTGGGGGCTG 
CAGCTGCGACCACATCCCAGGGTCGGGTTTAAAGATGAACTGCAACAACAGGAACGTGAGCAGCTTGGCTGATTT 
GAAGCCCAAGCTCTCTAACGTGCAGGAGCTTTTCCTACGAGATAACAAGATCCACAGCATCCGAAAATCGCACTT 
TGTGGATTACAAGAACCTCATTCTGTTGGATCTGGGCAACAATAACATCGCTACTGTAGAGAACAACACTTTCAA 
GAACCTTTTGGACCTCAGGTGGCTATACATGGATAGCAATTACCTGGACACGCTGTCCCGGGAGAAATTCGCGGG 
GCTGCAAAACCTAGAGTACCTGAACGTGGAGTACAACGCTATCCAGCTCATCCTCCCGGGCACTTTCAATGCCAT 
GCCCAAACTGAGGATCCTCATTCTCAACAACAACCTGCTGAGGTCCCTGCCTGTGGACGTGTTCGCTGGGGTCTC 
GCTCTCTAAACTCAGCCTGCACAACAATTACTTCATGTACCTCCCGGTGGCAGGGGTGCTGGACCAGTTAACCTC 
CATCATCCAGATAGACCTCCACGGAAACCCCTGGGAGTGCTCCTGCACAATTGTGCCTTTCAAGCAGTGGGCAGA 
ACGCTTGGGTTCCGAAGTGCTGATGAGCGACCTCAAGTGTGAGACGCCGGTGAACTTCTTTAGAAAGGATTTCAT 
GCTCCTCTCCAATGACGAGATCTGCCCTCAGCTGTACGCTAGGATCTCGCCCACGTTAACTTCGCACAGTAAAAA 
CAGCACTGGGTTGGCGGAGACCGGGACGCACTCCAACTCCTACCTAGACACCAGCAGGGTGTCCATCTCGGTGTT 
GGTCCCGGGACTGCTGCTGGTGTTTGTCACCTCCGCCTTCACCGTGGTGGGCATGCTCGTGTTTATCCTGAGGAA 
CCGAAAGCGGTCCAAGAGACGAGATGCCAACTCCTCCGCGTCCGAGATTAATTCCCTACAGACAGTCTGTGACTC 
TTCCTACTGGCACAATGGGCCTTACAACGCAGATGGGGCCCACAGAGTGTATGACTGTGGCTCTCACTCGCTCTC 
AGACTAAGACCCCAACCCCAATAGGGGAGGGCAGAGGGAAGGCGATACATCCTTCCCCACCGCAGGCACCCCGGG 
GGCTGGAGGGGCGTGTACCCAAATCCCCGCGCCATCAGCCTGGATGGGCATAAGTAGATAAATAACTGTGAGCTC 
GCACAACCGAAAGGGCCTGACCCCTTACTTAGCTCCCTCCTTGAAACAAAGAGCAGACTGTGGAGAGCTGGGAGA 
GCGCAGCCAGCTCGCTCTTTGCTGAGAGCCCCTTTTGACAGAAAGCCCAGCACGACCCTGCTGGAAGAACTGACA 
GTGCCCTCGCCCTCGGCCCCGGGGCCTGTGGGGTTGGATGCCGCGGTTCTATACATATATACATATATCCACATC 
TATATAGAGAGATAGATATCTATTTTTCCCCTGTGGATTAGCCCCGTGATGGCTCCCTGTTGGCTACGCAGGGAT 
GGGCAGTTGCACGAAGGCATGAATGTATTGTAAATAAGTAACTTTGACTTCTGAC 
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FIGURE 354 

MLLWILLLETSLCFAAGNVTGDVCKEKICSCNEIEGDLHVDCEKKGFTSLQRFTAPTSQFYHL 
FLHGNSLTRLFPNEFANFYNAVSLHMENNGLHEIVPGAFLGLQLVKRLHINNNKIKSFRKQTF 
LGLDDLEYLQADFNLLRDIDPGAFQDLNKLEVLILNDNLISTLPANVFQYVPITHLDLRGNRL 
KTLPYEEVLEQIPGIAEILLEDNPWDCTCDLLSLKEWLENIPKNALIGRVVCEAPTRLQGKDL 
NETTEQDLCPLKNRVDSSLPAPPAQEETFAPGPLPTPFKTNGQEDHAT PGSAPNGGTKIPGNW 
QIKIRPTAAIATGSSRNKPLANSLPCPGGCSCDHIPGSGLKMNCNNRNVSSLADLKPKLSNVQ 
ELFLRDNKIHSIRKSHFVDYKNLILLDLGNNNIATVENNTFKNLLDLRWLYMDSNYLDTLSRE 
KFAGLQNLEYLNVEYNAIQLILPGTFNAMPKLRILILNNNLLRSLPVDVFAGVSLSKLSLHNN 
YFMYLPVAGVLDQLTSIIQIDLHGNPWECSCTIVPFKQWAERLGSEVLMSDLKCETPVNFFRK 
DFMLLSNDEICPQLYARISPTLTSHSKNSTGLAETGTHSNSYLDTSRVSISVLVPGLLLVFVT 
SAFTWGMLVFILRNRKRSKRRDANSSASEINSLQTVCDSSYWHNGPYNADGAHRVYDCGSHS 
LSD 

Important features: 

Signal sequence: 

amino acids 1-15 

Transmembrane domain: 

amino acids 618-638 

♦ 

N-glycosylation site. 

amino acids 18-22, 253-257, 363-367, 416-420, 595-599, 655-659 

cftMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 122-126, 646-650 

Casein kinase II phosphorylation site. 

amino acids 30-34, 180-184, 222-226, 256-260, 366-370, 573-577, 
608-612, 657-661, 666-670, 693-697 

N-myristoylation site. 

amino acids 17-23, 67-73, 100-106, 302-308, 328-334, 343-349, 
354-360, 465-471, 493-499, 598-604, 603-609 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 337-348 



WO 01/40466 



PCT/US00/32678 



355/550 

FIGURE 355 

AGTCGACTGCGTCCCCTGTACCCGGCGCCAGCTGTGTTCCTGACCCCAGAATAACTCAGGGCTGCACCGGGCCTG 
GCAGCGCTCCGCACACATTTCCTGTCGCGGCCTAAGGGAAACTGTTGGCCGCTGGGCCCGCGGGGGGATTCTTGG 
CAGTTGGGGGGTCCGTCGGGAGCGAGGGCGGAGGGGAAGGGAGGGGGAACCGGGTTGGGGAAGCCAGCTGTAGAG 
GGCGGTGACCGCGCTCCAGACACAGCTCTGCGTCCTCGAGCGGGACAGATCCAAGTTGGGAGCAGCTCTGCGTGC 
GGGGCCTCAGAGAATGAGGCCGGCGTTCGCCCTGTGCCTCCTCTGGCAGGCGCTCTGGCCCGGGCCGGGCGGCGG 
CGAACACCCCACTGCCGACCGTGCTGGCTGCTCGGCCTCGGGGGCCTGCTACAGCCTGCACCACGCTACCATGAA 
GCGGCAGGCGGCCGAGGAGGCCTGCATCCTGCGAGGTGGGGCGCTCAGCACCGTGCGTGCGGGCGCCGAGCTGCG 
CGCTGTGCTCGCGCTCCTGCGGGCAGGCCCAGGGCCCGGAGGGGGCTCCAAAGACCTGCTGTTCTGGGTCGCACT 
GGAGCGCAGGCGTTCCCACTGCACCCTGGAGAACGAGCCTTTGCGGGGTTTCTCCTGGCTGTCCTCCGACCCCGG 
CGGTCTCGAAAGCGACACGCTGCAGTGGGTGGAGGAGCCCCAACGCTCCTGCACCGCGCGGAGATGCGCGGTACT 
CCAGGCCACCGGTGGGGTCGAGCCCGCAGGCTGGAAGGAGATGCGATGCCACCTGCGCGCCAACGGCTACCTGTG 
CAAGTACCAGTTTGAGGTCTTGTGTCCTGCGCCGCGCCCCGGGGCCGCCTCTAACTTGAGCTATCGCGCGCCCTT 
CCAGCTGCACAGCGCCGCTCTGGACTTCAGTCCACCTGGGACCGAGGTGAGTGCGCTCTGCCGGGGACAGCTCCC 
GATCTCAGTTACTTGCATCGCGGACGAAATCGGCGCTCGCTGGGACAAACTCTCGGGCGATGTGTTGTGTCCCTG 
CCCCGGGAGGTACCTCCGTGCTGGCAAATGCGCAGAGCTCCCTAACTGCCTAGACGACTTGGGAGGCTTTGCCTG 
CGAATGTGCTACGGGCTTCGAGCTGGGGAAGGACGGCCGCTCTTGTGTGACCAGTGGGGAAGGACAGCCGACCCT 
TGGGGGGACCGGGGTGCCCACCAGGCGCCCGCCGGCCACTGCAACCAGCCCCGTGCCGCAGAGAACATGGCCAAT 
CAGGGTCGACGAGAAGCTGGGAGAGACACCACTTGTCCCTGAACAAGACAATTCAGTAACATCTATTCCTGAGAT 
TCCTCGATGGGGATCACAGAGCACGATGTCTACCCTTCAAATGTCCCTTCAAGCCGAGTCAAAGGCCACTATCAC 
CCCATCAGGGAGCGTGATTTCCAAGTTTAATTCTACGACTTCCTCTGCCACTCCTCAGGCTTTCGACTCCTCCTC 
TGCCGTGGTCTTCATATTTGTGAGCACAGCAGTAGTAGTGTTGGTGATCTTGACCATGACAGTACTGGGGCTTGT 
CAAGCTCTGCTTTCACGAAAGCCCCTCTTCCCAGCCAAGGAAGGAGTCTATGGGCCCGCCGGGCCTGGAGAGTGA 
TCCTGAGCCCGCTGCTTTGGGCTCCAGTTCTGCACATTGCACAAACAATGGGGTGAAAGTCGGGGACTGTGATCT 
GCGGGACAGAGCAGAGGGTGCCTTGCTGGCGGAGTCCCCTCTTGGCTCTAGTGATGCA5PAGGGAAACAGGGGACA 
TGGGCACTCCTGTGAACAGTTTTTCACTTTTGATGAAACGGGGAACCAAGAGGAACTTACTTGTGTAACTGACAA 
TTTCTGCAGAAATCCCCCTTCCTCTAAATTCCCTTTACTCCACTGAGGAGCTAAATCAGAACTGCACACTCCTTC 
CCTGATGATAGAGGAAGTGGAAGTGCCTTTAGGATGGTGATACTGGGGGACCGGGTAGTGCTGGGGAGAGATATT 
TTCTTATGTTTATTCGGAGAATTTGGAGAAGTGATTGAACTTTTCAAGACATTGGAAACAAATAGAACACAATAT 
AATTTACATTAAAAAATAATTTCTACCAAAATGGAAAGGAAATGTTCTATGTTGTTCAGGCTAGGAGTATATTGG 
TTCGAAATCCCAGGGAAAAAAATAAAAATAAAAAATTAAAGGATTGTTGAT 
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FIGURE 356 

MRPAFALCLLWQALWPGPGGGEHPTADRAGCSASGACYSLHHATMKRQAAEEACILRGGALST 
VRAGAELRAVLALLRAGPGPGGGSKDLLFWVALERRRSHCTLENEPLRGFSWLSSDPGGLESD 
TLQWVEEPQRSCTARRCAVLQATGGVEPAGWKEMRCHLRANGYLCKYQFEVLCPAPRPGAASN 
LSYRAPFQLHSAALDFSPPGTEVSALCRGQLPISVTCIADEIGARWDKLSGDVLCPCPGRYLR 
AGKCAELPNCLDDLGGFACECATGFELGKDGRSCVTSGEGQPTLGGTGVPTRRPPATATSPVP 
QRTWPIRVDEKLGETPLVPEQDNSVTSIPEIPRWGSQSTMSTLQMSLQAESKATITPSGSVIS 
KFNSTTSSATPQAFDSSSAVVFIFVSTAVVVLVILTMTVLGLVKLCFHESPSSQPRKESMGPP 
GLESDPEPAALGSSSAHCTNNGVKVGDCDLRDRAEGALLAESPLGSSDA 

Important features: 

Signal sequence: 

amino acids 1-16 . 

Transmembrane domain: 

amino acids 399-418 

N-glycosylation site. 

amino acids 189-193, 381-385 

Glycosaminoglycan attachment site. 

amino acids 289-293 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 98-102, 434-438 

Casein kinase II phosphorylation site. 

amino acids 275-279, 288-292, 342-346, 445-449 

N-myristoylation site. 

amino acids 30-36, 35-41, 58-64, 59-65, 121-127, 151-157, 

185-191, 209-215, 2 67-273, 350-356, 374-380, 453-4 59, 463-4 69, 
477-483 

Aspartic acid and asparagine hydroxylation site. 

amino acids 262-274 
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FIGURE 357 

CCCATCTCAAGCTGATCTTGGCACCTCTCATGCTCTGCTCTCTTCAACCAGACCTCTACATTCCATTTTGGAAGA 
AGACTAAAAATGGTGTTTCCAATGTGGACACTGAAGAGACAAATTCTTATCCTTTTTAACATAATCCTAATTTCC 
AAACTCCTTGGGGCTAGATGGTTTCCTAAAACTCTGCCCTGTGATGTCACTCTGGATGTTCCAAAGAACCATGTG 
ATCGTGGACTGCACAGACAAGCATTTGACAGAAATTGCTGGAGGTATTCCCACGAACACCACGAACCTCACCCTC 
ACCATTAACCACATACCAGACATCTCCCCAGCGTCCTTTCACAGACTGGACCATCTGGTAGAGATCGATTTCAGA 
TGCAACTGTGTACCTATTCCACTGGGGTCAAAAAACAACATGTGCATCAAGAGGCTGCAGATTAAACCCAGAAGC 
TTTAGTGGACTCACTTATTTAAAATCCCTTTACCTGGATGGAAACCAGCTACTAGAGATACCGCAGGGCCTCCCG 
CCTAGCTTACAGCTTCTCAGCCTTGAGGCCAACAACATCTTTTCCATCAGAAAAGAGAATCTAACAGAACTGGCC 
AACATAGAAATACTCTACCTGGGCCAAAACTGTTATTATCGAAATCCTTGTTATGTTTCATATTCAATAGAGAAA 
GATGCCTTCCTAAACTTGACAAAGTTAAAAGTGCTCTCCCTGAAAGATAACAATGTCACAGCCGTCCCTACTGTT 
TTGCCATCTACTTTAACAGAACTATATCTCTACAACAACATGATTGCAAAAATCCAAGAAGATGATTTTAATAAC 
CTCAACCAATTACAAATTCTTGACCTAAGTGGAAATTGCCCTCGTTGTTATAATGCCCCATTTCCTTGTGCGCCG 
TGTAAAAATAATTCTCCCCTACAGATCCCTGTAAATGCTTTTGATGCGCTGACAGAATTAAAAGTTTTACGTCTA 
CACAGTAACTCTCTTCAGCATGTGCCCCCAAGATGGTTTAAGAACATCAACAAACTCCAGGAACTGGATCTGTCC 
CAAAACTTCTTGGCCAAAGAAATTGGGGATGCTAAATTTCTGCATTTTCTCCCCAGCCTCATCCAATTGGATCTG 
TCTTTCAATTTTGAACTTCAGGTCTATCGTGCATCTATGAATCTATCACAAGCATTTTCTTCACTGAAAAGCCTG 
AAAATTCTGCGGATCAGAGGATATGTCTTTAAAGAGTTG7VAAAGCTTTAACCTCTCGCCATTACATAATCTTCAA 
AATCTTGAAGTTCTTGATCTTGGCACTAACTTTATAAAAATTGCTAACCTCAGCATGTTTAAACAATTTAAAAGA 
CTGAAAGTCATAGATCTTTCAGTGAATAAAATATCACCTTCAGGAGATTCAAGTGAAGTTGGCTTCTGCTCAAAT 
GCCAGAACTTCTGTAGAAAGTTATGAACCCCAGGTCCTGGAACAATTACATTATTTCAGATATGATAAGTATGCA 
AGGAGTTGCAGATTCAAAAACAAAGAGGCTTCTTTCATGTCTGTTAATGAAAGCTGCTACAAGTATGGGCAGACC 
TTGGATCTAAGTAAAAATAGTATATTTTTTGTCAAGTCCTCTGATTTTCAGCATCTTTCTTTCCTCAAATGCCTG 
AATCTGTCAGGAAATCTCATTAGCCAAACTCTTAATGGCAGTGAATTCCAACCTTTAGCAGAGCTGAGATATTTG 
GACTTCTCCAACAACCGGCTTGATTTACTCCATTCAACAGCATTTGAAGAGCTTCACAAACTGGAAGTTCTGGAT 
ATAAGCAGTAATAGCCATTATTTTCAATCAGAAGGAATTACTCATATGCTAAACTTTACCAAGAACCTAAAGGTT 
CTGCAGAAACTGATGATGAACGACAATGACATCTCTTCCTCCACCAGCAGGACCATGGAGAGTGAGTCTCTTAGA 
ACTCTGGAATTCAGAGGAAATCACTTAGATGTTTTATGGAGAGAAGGTGATAACAGATACTTACAATTATTCAAG 
AATCTGCTAAAATTAGAGGAATTAGACATCTCTAAAAATTCCCTAAGTTTCTTGCCTTCTGGAGTTTTTGATGGT 
ATGCCTCCAAATCTAAAGAATCTCTCTTTGGCCAAAAATGGGCTCAAATCTTTCAGTTGGAAGAAACTCCAGTGT 
CTAAAGAACCTGGAAACTTTGGACCTCAGCCACAACCAACTGACCACTGTCCCTGAGAGATTATCCAACTGTTCC 
AGAAGCCTCAAGAATCTGATTCTTAAGAATAATCAAATCAGGAGTCTGACGAAGTATTTTCTACAAGATGCCTTC 
CAGTTGCGATATCTGGATCTCAGCTCAAATAAAATCCAGATGATCCAAAAGACCAGCTTCCCAGAAAATGTCCTC 
AACAATCTGAAGATGTTGCTTTTGCATCATAATCGGTTTCTGTGCACCTGTGATGCTGTGTGGTTTGTCTGGTGG 
GTTAACCATACGGAGGTGACTATTCCTTACCTGGCCACAGATGTGACTTGTGTGGGGCCAGGAGCACACAAGGGC 
CAAAGTGTGATCTCCCTGGATCTGTACACCTGTGAGTTAGATCTGACTAACCTGATTCTGTTCTCACTTTCCATA 
TCTGTATCTCTCTTTCTCATGGTGATGATGACAGCAAGTCACCTCTATTTCTGGGATGTGTGGTATATTTACCAT 
TTCTGTAAGGCCAAGATAAAGGGGTATCAGCGTCTAATATCACCAGACTGTTGCTATGATGCTTTTATTGTGTAT 
GACACTAAAGACCCAGCTGTGACCGAGTGGGTTTTGGCTGAGCTGGTGGCCAAACTGGAAGACCCAAGAGAGAAA 
CATTTTAATTTATGTCTCGAGGAAAGGGACTGGTTACCAGGGCAGCCAGTTCTGGAAAACCTTTCCCAGAGCATA 
CAGCTTAGCAAAAAGACAGTGTTTGTGATGACAGACAAGTATGCAAAGACTGAAAATTTTAAGATAGCATTTTAC 
TTGTCCCATCAGAGGCTCATGGATGAAAAAGTTGATGTGATTATCTTGATATTTCTTGAGAAGCCCTTTCAGAAG 
TCCAAGTTCCTCCAGCTCCGGAAAAGGCTCTGTGGGAGTTCTGTCCTTGAGTGGCCAACAAACCCGCAAGCTCAC 
CCATACTTCTGGCAGTGTCTAAAGAACGCCCTGGCCACAGACAATCATGTGGCCTATAGTCAGGTGTTCAAGGAA 
ACGGTCTAGCCCTTCTTTGCAAAACACAACTGCCTAGTTTACCAAGGAGAGGCCTGGC 
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FIGURE 358 

MVFPMWTLKRQILILFNIILISKLLGARWFPKTLPCDVTLDVPKNHVIVDCTDKHLTEIPGGI 
PTNTTNLTLTINHIPDISPASFHRLDHLVEIDFRCNCVPIPLGSKNNMCIKRLQIKPRSFSGL 
TYLKSLYLDGNQLLEIPQGLPPSLQLLSLEANNIFSIRKENLTELANIEILYLGQNCYYRNPC 
YVSYSIEKDAFLNLTKLKVLSLKDNNVTAVPTVLPSTLTELYLYNNMIAKIQEDDFNNLNQLQ 
ILDLSGNCPRCYNAPFPCAPCKNNSPLQIPVNAFDALTELKVLRLHSNSLQHVPPRWFKNINK 
LQELDLSQNFLAKEIGDAKFLHFLPSLIQLDLSFNFELQVYRASMNLSQAFSSLKSLKILRIR 
GYVFKELKSFNLS PLHNLQNLEVLDLGTNFIKIANLSMFKQFKRLKVIDLSVNKISPSGDSSE 
VGFCSNARTSVESYEPQVLEQLHYFRYDKYARSCRFKNKEASFMSVNESCYKYGQTLDLSKNS 
IFFVKSSDFQHLSFLKCLNLSGNLISQTLNGSEFQPLAELRYLDFSNNRLDLLHSTAFEELHK 
LEVLDISSNSHYFQSEGITHMLNFTKNLKVLQKLMMNDNDISSSTSRTMESESLRTLEFRGNH 
LDVLWREGDNRYLQLFKNLLKLEELDISKNSLSFLPSGVFDGMPPNLKNLSLAKNGLKSFSWK 
KLQCLKNLETLDLSHNQLTTVPERLSNCSRSLKNLILKNNQIRSLTKYFLQDAFQLRYLDLSS 
NKIQMIQKTSFPENVLNNLKMLLLHHNRFLCTCDAVWFVWWVNHTEVTIPYLATDVTCVGPGA 
HKGQSVISLDLYTCELDLTNLILFSLSISVSLFLMVMMTASHLYFWDVWYIYHFCKAKIKGYQ 
RLISPDCCYDAFIVYDTKDPAVTEWVLAELVAKLEDPREKHFNLCLEERDWLPGQPVLENLSQ 
SIQLSKKTVFVMTDKYAKTENFKIAFYLSHQRLMDEKVDVIILIFLEKPFQKSKFLQLRKRLC 
GSSVLEWPTNPQAHPYFWQCLKNALATDNHVAYSQVFKETV 

# 

Important features: 
Signal sequence: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 840-860 
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FIGURE 359 

GACGGCTGGCCACCMGCACGGCTCCTGCAGTTTCCTGATGCTTCTGCTGCCGCTACTGCTAC 
TGCTGGTGGCCACCACAGGCCCCGTTGGAGCCCTCACAGATGAGGAGAAACGTTTGATGGTGG 
AGCTGCACAACCTCTACCGGGCCCAGGTATCCCCGACGGCCTCAGACATGCTGCACATGAGAT 
GGGACGAGGAGCTGGCCGCCTTCGCCAAGGCCTACGCACGGCAGTGCGTGTGGGGCCACAACA 
AGGAGCGCGGGCGCCGCGGCGAGAATCTGTTCGCCATCACAGACGAGGGCATGGACGTGCCGC 
TGGCCATGGAGGAGTGGCACCACGAGCGTGAGCACTACAACCTCAGCGCCGCCACCTGCAGCC 
CAGGCCAGATGTGCGGCCACTACACGCAGGTGGTATGGGCCAAGACAGAGAGGATCGGCTGTG 
GTTCCCACTTCTGTGAGAAGCTCCAGGGTGTTGAGGAGACCAACATCGAATTACTGGTGTGCA 
ACTATGAGCCTCCGGGGAACGTGAAGGGGAAACGGCCCTACCAGGAGGGGACTCCGTGCTCCC 
AATGTCCCTCTGGCTACCACTGCAAGAACTCCCTCTGTGAACCCATCGGAAGCCCGGAAGATG 
CTCAGGATTTGCCTTACCTGGTAACTGAGGCCCCATCCTTCCGGGCGACTGAAGCATCAGACT 
CTAGGAAAATGGGTACTCCTTCTTCCCTAGCAACGGGGATTCCGGCTTTCTTGGTAACAGAGG 
TCTCAGGCTCCCTGGCAACCAAGGCTCTGCCTGCTGTGGAAACCCAGGCCCCAACTTCCTTAG 
CAACGAAAGACCCGCCCTCCATGGCAACAGAGGCTCCACCTTGCGTAACAACTGAGGTCCCTT 
CCATTTTGGCAGCTCACAGCCTGCCCTCCTTGGATGAGGAGCCAGTTACCTTCCCCAAATCGA 
CCCATGTTCCTATCCCAAAATCAGCAGACAAAGTGACAGACAAAACAAAAGTGCCCTCTAGGA 
GCCCAGAGAACTCTCTGGACCCCAAGATGTCCCTGACAGGGGCAAGGGAACTCCTACCCCATG 
CCCAGGAGGAGGCTGAGGCTGAGGCTGAGTTGCCTCCTTCCAGTGAGGTCTTGGCCTCAGTTT 
TTCGAGCCCAGGACAAGCCAGGTGAGCTGCAGGCCACACTGGACCACACGGGGCACACCTCCT 
CCAAGTCCCTGCCCAATTTCCCCAATACCTCTGCCACCGCTAATGCCACGGGTGGGCGTGCCC 
TGGCTCTGCAGTCGTCCTTGCCAGGTGCAGAGGGCCCTGACAAGCCTAGCGTTGTGTCAGGGC 
TGAACTCGGGCCCTGGTCATGTGTGGGGCCCTCTCCTGGGACTACTGCTCCTGCCTCCTCTGG 
TGTTGGCTGGAATCTTCTGAATGGGATACCACTCAAAGGGTGAAGAGGTCAGCTGTCCTCCTG 
TCATCTTCCCCACCCTGTCCCCAGCCCCTAAACAAGATACTTCTTGGTTAAGGCCCTCCGGAA 
GGGAAAGGCTACGGGGCATGTGCCTCATCACACCATCCATCCTGGAGGCACAAGGCCTGGCTG 
GCTGCGAGCTCAGGAGGCCGCCTGAGGACTGCACACCGGGCCCACACCTCTCCTGCCCCTCCC 
TCCTGAGTCCTGGGGGTGGGAGGATTTGAGGGAGCTCACTGCCTACCTGGCCTGGGGCTGTCT 
GCCCACACAGCATGTGCGCTCTCCCTGAGTGCCTGTGTAGCTGGGGATGGGGATTCCTAGGGG 
CAGATGAAGGAC/^AGCCCCACTGGAGTGGGGTTCTTTGAGTGGGGGAGGCAGGGACGAGGGAA 
GGAAAGTAACTCCTGACTCTCCAATAAAAACCTGTCCAACCTGTGAAA 
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FIGURE 360 

MHGSCSFLMLLLPLLLLLVATTGPVGALTDEEKRLMVELHNLYRAQVSPTASDMLHMRWDEEL 
AAFAKAYARQCVWGHNKERGRRGENLFAITDEGMDVPLAMEEWHHEREHYNLSAATCSPGQMC 
GHYTQVVWAKTERIGCGSHFCEKLQGVEETNIELLVCNYEPPGNVKGKRPYQEGTPCSQCPSG 
YHCKNSLCEPIGSPEDAQDLPYLVTEAPSFRATEASDSRKMGTPSSLATGIPAFLVTEVSGSL 
ATKALPAVETQAPTSLATKDPPSMATEAPPCVTTEVPSILAAHSLPSLDEEPVTFPKSTHVPI 
PKSADKVTDKTKVPSRSPENSLDPKMSLTGARELLPHAQEEAEAEAELPPSSEVLASVFPAQD 
KPGELQATLDHTGHTSSKSLPNFPNTSATANATGGRALALQSSLPGAEGPDKPSVVSGLNSGP 
GH VWG PLLGLLLL P PL VL AG I F 

Important features: 
Signal sequence: 

amino acids 1-22 

N-glycosylation site. 

amino acids 114-118, 403-407, 409-413 

Glycosami noglycan attachment site. 

amino acids 439-443 

Casein kinase II phosphorylation site . 

amino acids 29-33, 50-54, 156-160, 195-199, 202-206, 299-303 
N-myristoylation site. 

amino acids 123-129, 143-149, 152-158, 169-175, 180-186, 231-237, 250-256 

Ami da t ion site. 

amino acids 82-86, 172-176 

Peroxidases proximal heme-ligand signature. * 

amino acids 287-298 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 signature 1 . 

amino acids 127-138 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 signature 2. 

amino acids 160-172 
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FIGURE 361 

GACTAGTTCTCTTGGAGTCTGGGAGGAGGAAAGCGGAGCCGGCAGGGAGCGAACCAGGACTGG 
GGTGACGGCAGGGCAGGGGGCGCCTGGCCGGGGAGAAGCGCGGGGGCTGGAGCACCACCAACT 
GGAGGGTCCGGAGTAGCGAGCGCCCCG7\AGGAGGCCATCGGGGAGCCGGGAGGGGGGACTGCG 
AGAGGACCCCGGCGTCCGGGCTCCCGGTGCCAGCGCTATGAGGCCACTCCTCGTCCTGCTGCT 
CCTGGGCCTGGCGGCCGGCTCGCCCCCACTGGACGACAACAAGATCCCCAGCCTCTGCCCGGG 
GCACCCCGGCCTTCCAGGCACGCCGGGCCACCATGGCAGCCAGGGCTTGCCGGGCCGCGATGG 
CCGCGACGGCCGCGACGGCGCGCCCGGGGCTCCGGGAGAGAAAGGCGAGGGCGGGAGGCCGGG 
ACTGCCGGGACCTCGAGGGGACCCCGGGCCGCGAGGAGAGGCGGGACCCGCGGGGCCCACCGG 
GCCTGCCGGGGAGTGCTCGGTGCCTCCGCGATCCGCCTTCAGCGCCAAGCGCTCCGAGAGCCG 
GGTGCCTCCGCCGTCTGACGCACCCTTGCCCTTCGACCGCGTGCTGGTGAACGAGCAGGGACA 
TTACGACGCCGTCACCGGCAAGTTCACCTGCCAGGTGCCTGGGGTCTACTACTTCGCCGTCCA 
TGCCACCGTCTACCGGGCCAGCCTGCAGTTTGATCTGGTGAAGAATGGCGAATCCATTGCCTC 
TTTCTTCCAGTTTTTCGGGGGGTGGCCCAAGCCAGCCTCGCTCTCGGGGGGGGCCATGGTGAG 
GCTGGAGCCTGAGGACCAAGTGTGGGTGCAGGTGGGTGTGGGTGACTACATTGGCATCTATGC 
CAGCATCAAGACAGACAGCACCTTCTCCGGATTTCTGGTGTACTCCGACTGGCACAGCTCCCC 
AGTCTTTGCTTAGTGCCCACTGCAAAGTGAGCTCATGCTCTCACTCCTAGAAGGAGGGTGTGA 
GGCTGACAACCAGGTCATCCAGGAGGGCTGGCCCCCCTGGAATATTGTGAATGACTAGGGAGG 
TGGGGTAGAGCACTCTCCGTCCTGCTGCTGGCAAGGAATGGGAACAGTGGCTGTCTGCGATCA 
GGTCTGGCAGCATGGGGCAGTGGCTGGATTTCTGCCCAAGACCAGAGGAGTGTGCTGTGCTGG 
CAAGTGTAAGTCCCCCAGTTGCTCTGGTCCAGGAGCCCACGGTGGGGTGCTCTCTTCCTGGTC 
CTCTGCTTCTCTGGATCCTCCCCACCCCCTCCTGCTCCTGGGGCCGGCCCTTTTCTCAGAGAT 
CACTCAATAAACCTAAGAACCCTCATAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 362 

MRPLLVLLLLGLAAGSPPLDDNKIPSLCPGHPGLPGTPGHHGSQGLPGRDGRDGRDGAPGAPG 
EKGEGGRPGLPGPRGDPGPRGEAGPAGPTGPAGECSVPPRSAFSAKRSESRVPPPSDAPLPFD 
RVLVNEQGHYDAVTGKFTCQVPGVYYFAVHATVYRASLQFDLVKNGESIASFFQFFGGWPKPA 
SLSGGAMVRLEPEDQVWVQVGVGDYIGIYASIKTDSTFSGFLVYSDWHSSPVFA 

Important features: 
Signal sequence . 

amino acids 1-15 

N-myristoylation sites. 

amino acids 11-17, 68-74, 216-222 

Cell attachment sequence. 

amino acids 77-80 
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FIGURE 363 

GGAGAGCGGAGCGAAGCTGGATAACAGGGGACCGAT6ATGTGGCGACCATCAGTTCTGCTGCT 
TCTGTTGCTACTGAGGCACGGGGCCCAGGGGAAGCCATCCCCAGACGCAGGCCCTCATGGCCA 
GGGGAGGGTGCACCAGGCGGCCCCCCTGAGCGACGCTCCCCATGATGACGCCCACGGGAACTT 
CCAGTACGACCATGAGGCTTTCCTGGGACGGGAAGTGGCCAAGGAATTCGACCAACTCACCCC 
AGAGGAAAGCCAGGCCCGTCTGGGGCGGATCGTGGACCGCATGGACCGCGCGGGGGACGGCGA 
CGGCTGGGTGTCGCTGGCCGAGCTTCGCGCGTGGATCGCGCACACGCAGCAGCGGCACATACG 
GGACTCGGTGAGCGCGGCCTGGGACACGTACGACACGGACCGCGACGGGCGTGTGGGTTGGGA 
GGAGCTGCGCAACGCCACCTATGGCCACTACGCGCCCGGTGAAGAATTTCATGACGTGGAGGA 
TGCAGAGACCTACAAAAAGATGCTGGCTCGGGACGAGCGGCGTTTCCGGGTGGCCGACCAGGA 
TGGGGACTCGATGGCCACTCGAGAGGAGCTGACAGCCTTCCTGCACCCCGAGGAGTTCCCTCA 
CATGCGGGACATCGTGATTGCTGAAACCCTGGAGGACCTGGACAGAAACAAAGATGGCTATGT 
CCAGGTGGAGGAGTACATCGCGGATCTGTACTCAGCCGAGCCTGGGGAGGAGGAGCCGGCGTG 
GGTGCAGACGGAGAGGCAGCAGTTCCGGGACTTCCGGGATCTGAACAAGGATGGGCACCTGGA 
TGGGAGTGAGGTGGGCCACTGGGTGCTGCCCCCTGCCCAGGACCAGCCCCTGGTGGAAGCCAA 
CCACCTGCTGCACGAGAGCGACACGGACAAGGATGGGCGGCTGAGCAAAGCGGAAATCCTGGG 
TAATTGGAACATGTTTGTGGGCAGTCAGGCCACCAACTATGGCGAGGACCTGACCCGGCACCA 
CGATGAGCTGTGAGCACCGCGCACCTGCCACAGCCTCAGAGGCCCGCACAATGACCGGAGGAG 
GGGCCGCTGTGGTCTGGCCCCCTCCCTGTCCAGGCCCCGCAGGAGGCAGATGCAGTCCCAGGC 
ATCCTCCTGCCCCTGGGCTCTCAGGGACCCCCTGGGTCGGCTTCTGTCCCTGTCACACCCCCA 
ACCCCAGGGAGGGGCTGTCATAGTCCCAGAGGATAAGCAATACCTATTTCTGACTGAGTCTCC 
CAGCCCAGACCCAGGGACCCTTGGCCCCAAGCTCAGCTCTAAGAACCGCCCCAACCCCTCCAG 
CTCCAAATCTGAGCCTCCACCACATAGACTGAAACTCCCCTGGCCCCAGCCCTCTCCTGCCTG 
GCCTGGCCTGGGACACCTCCTCTCTGCCAGGAGGCAATAAAAGCCAGCGCCGGGACCTTGAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 364 

MMWRPSVLLLLLLLRHGAQGKPSPDAGPHGQGRVHQAAPLSDAPHDDAHGNFQYDHEAFLGRE 
VAKEFDQLTPEESQARLGRIVDRMDRAGDGDGWVSLAELRAWIAHTQQRHIRDSVSAAWDTYD 
TDRDGRVGWEELRNATYGHYAPGEEFHDVEDAETYKKMLARDERRFRVADQDGDSMATREELT 
AFLHPEEFPHMRDIVIAETLEDLDRNKDGYVQVEEYIADLYSAEPGEEEPAWVQTERQQFRDF 
RDLNKDGHLDGSEVGHWVLPPAQDQPLVEANHLLHESDTDKDGRLSKAEILGNWNMFVGSQAT 
N YGE DLTRHH DEL 

Important features : 
Signal sequence: 

amino acids 1-20 

N-glycosylation site. 

amino acids 140-144 

Casein kinase II phosphorylation site. 

amino acids 72-76, 98-102, 127-131, 184-188, 208-212, 289-293, 
291-295, 298-302 

N-myristoylation site. 

amino acids 263-269, 311-317 

Endoplasmic reticulum targeting sequence. 

amino acids 325-330 
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FIGURE 365 

GTCTGTTCCCAGGAGTCCTTCGGCGGCTGTTGTGTCAGTGGCCTGATCGCGATGGGGACAAAG 
GCGCAAGTCGAGAGGAAACTGTTGTGCCTCTTCATATTGGCGATCCTGTTGTGCTCCCTGGCA 
TTGGGCAGTGTTACAGTGCACTCTTCTGAACCTGAAGTCAGAATTCCTGAGAATAATCCTGTG 
AAGTTGTCCTGTGCCTACTCGGGCTTTTCTTCTCCCCGTGTGGAGTGGAAGTTTGACCAAGGA 
GACACCACCAGACTCGTTTGCTATAATAACAAGATCACAGCTTCCTATGAGGACCGGGTGACC 
TTCTTGCCAACTGGTATCACCTTCAAGTCCGTGACACGGGAAGACACTGGGACATACACTTGT 
ATGGTCTCTGAGGAAGGCGGCAACAGCTATGGGGAGGTCAAGGTCAAGCTCATCGTGCTTGTG 
CCTCCATCCAAGCCTACAGTTAACATCCCCTCCTCTGCCACCATTGGGAACCGGGCAGTGCTG 
ACATGCTCAGAACAAGATGGTTCCCCACCTTCTGAATACACCTGGTTCAAAGATGGGATAGTG 
-ATGCCTACGAATCCCAAAAGCACCCGTGCCTTCAGCAACTCTTCCTATGTCCTGAATCCCACA 
ACAGGAGAGCTGGTCTTTGATCCCCTGTCAGCCTCTGATACTGGAGAATACAGCTGTGAGGCA 
CGGAATGGGTATGGGACACCCATGACTTCAAATGCTGTGCGCATGGAAGCTGTGGAGCGGAAT 
GTGGGGGTCATCGTGGCAGCCGTCCTTGTAACCCTGATTCTCCTGGGAATCTTGGTTTTTGGC 
ATCTGGTTTGCCTATAGCCGAGGCCACTTTGACAGAACAAAGAAAGGGACTTCGAGTAAGAAG 
GTGATTTACAGCCAGCCTAGTGCCCGAAGTGAAGGAGAATTCAAACAGACCTCGTCATTCCTG 
GTGTGAGCCTGGTCGGCTCACCGCCTATCATCTGCATTTGCCTTACTCAGGTGCTACCGGACT 
CTGGCCCCTGATGTCTGTAGTTTCACAGGATGCCTTATTTGTCTTCTACACCCCACAGGGCCC 
CCTACTTCTTCGGATGTGTTTTTAATAATGTCAGCTATGTGCCCCATCCTCCTTCATGCCCTC 
CCTCCCTTTCCTACGACTGCTGAGTGGCCTGGAACTTGTTTAAAGTGTTTATTCCCCATTTCT 
TTGAGGGATCAGGAAGGAATCCTGGGTATGCCATTGACTTCCCTTCTAAGTAGACAGCAAAAA 
TGGCGGGGGTCGCAGGAATCTGCACTCAACTGCCCACCTGGCTGGCAGGGATCTTTGAATAGG 
TATCTTGAGCTTGGTTCTGGGCTCTTTCCTTGTGTACTGACGACCAGGGCCAGCTGTTCTAGA 
GCGGGAATTAGAGGCTAGAGCGGCTGAAATGGTTGTTTGGTGATGACACTGGGGTCCTTCCAT 
CTCTGGGGCCCACTCTCTTCTGTCTTCCCATGGGAAGTGCCACTGGGATCCCTCTGCCCTGTC 
CTCCTGAATACAAGCTGACTGACATTGACTGTGTCTGTGGAAAATGGGAGCTCTTGTTGTGGA 
GAGCATAGTAAATTTTCAGAGAACTTGAAGCCAAAAGGATTTAAAACCGCTGCTCTAAAGAAA 
AGAAAACTGGAGGCTGGGCGCAGTGGCTCACGCCTGTAATCCCAGAGGCTGAGGCAGGCGGAT 
CACCTGAGGTCGGGAGTTCGGGATCAGCCTGACCAACATGGAGAAACCCTACTGGAAATACAA 
AGTTAGCCAGGCATGGTGGTGCATGCCTGTAGTCCCAGCTGCTCAGGAGCCTGGCAACAAGAG 
CAAAACTCCAGCTCAAAAAAAAAAAAAAAA 
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MGTKAQVERKLLCLFILAILLCSLALGSVTVHSSEPEVRIPENNPVKLSCAYSGFSSPRVEWK 
FDQGDTTRLVCYNNKITASYEDRVTFLPTGITFKSVTREDTGTYTCMVSEEGGNSYGEVKVKL 
IVLVPPSKPTVNIPSSATIGNRAVLTCSEQDGSPPSEYTWFKDGIVMPTNPKSTRAFSNSSYV 
LNPTTGELVFDPLSASDTGEYSCEARNGYGTPMTSNAVRMEAVERNVGVIVAAVLVTLILLGI 
LVFGIWFAYSRGHFDRTKKGTSSKKVIYSQPSARSEGEFKQTSSFLV 

Important features: 
Signal sequence: 

amino acids 1-27 

Transmembrane domain: 

amino acids 238-255 

N-glycosylation site. 

amino acids 185-189 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 270-274 

Casein kinase II phosphorylation site. 

amino acids 34-38, 82-86, 100-104, 118-122, 152-156, 154-158, 
193-197, 203-207, 287-291 

N-myristoylation site . . 

amino acids 105-111, 116-122, 158-164, 219-225, 237-243, 256-262 
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FIGURE 367 

GGGGAGAGGAATTGACCATGTAAAAGGAGACTTTTTTTTTTGGTGGTGGTGGCTGTTGGGTGCCTTGCAAAAATG 
AAGGATGCAGGACGCAGCTTTCTCCTGGAACCGAACGCAATGGATAAACTGATTGTGCAAGAGAGAAGGAAGAAC 
GAAGCTTTTTCTTGTGAGCCCTGGATCTTAACACAAATGTGTATATGTGCACACAGGGAGCATTCAAGAATGAAA 
TAAACCAGAGTTAGACCCGCGGGGGTTGGTGTGTTCTGACATAAATAAATAATCTTAAAGCAGCTGTTCCCCTCC 
CCACCCCCAAAAAAAAGGATGATTGGAAATGAAGAACCGAGGATTCACAAAGAAAAAAGTATGTTCATTTTTCTC 
TATi\AAGGAGAAAGTGAGCCAAGGAGATATTTTTGGAATGAAAAGTTTGGGGCTTTTTTAGTAAAGTAAAGAACT 
GGTGTGGTGGTGTTTTCCTTTCTTTTTGAATTTCCCACAAGAGGAGAGGAAATTAATAATACATCTGCAAAGAAA 
TTTCAGAGAAGAAAAGTTGACCGCGGCAGATTGAGGCATTGATTGGGGGAGAGAAACCAGCAGAGCACAGTTGGA 
TTTGTGCCTATGTTGACTAAAATTGACGGATAATTGCAGTTGGATTTTTCTTCATCAACCTCCTTTTTTTTAAAT 
TTTTATTCCTTTTGGTATCAAGATCATGCGTTTTCTCTTGTTCTTAACCACCTGGATTTCCATCTGGATGTTGCT 
GTGATCAGTCTGAAATACAACTGTTTGAATTCCAGAAGGACCAACACCAGATAAATTATGAATGTTGAACAAGAT 
GACCTTACATCCACAGCAGATAATGATAGGTCCTAGGTTTAACAGGGCCCTATTTGACCCCCTGCTTGTGGTGCT 
GCTGGCTCTTCAACTTCTTGTGGTGGCTGGTCTGGTGCGGGCTCAGACCTGCCCTTCTGTGTGCTCCTGCAGCAA 
CCAGTTCAGCAAGGTGATTTGTGTTCGGAAAAACCTGCGTGAGGTTCCGGATGGCATCTCCACCAACACACGGCT 
GCTGAACCTCCATGAGAACCAAATCCAGATCATCAAAGTGAACAGCTTCAAGCACTTGAGGCACTTGGAAATCCT 
ACAGTTGAGTAGGAACCATATCAGAACCATTGAAATTGGGGCTTTCAATGGTCTGGCGAACCTCAACACTCTGGA 
ACTCTTTGACAATCGTCTTACTACCATCCCGAATGGAGCTTTTGTATACTTGTCTAAACTGAAGGAGCTCTGGTT 
GCGAAACAACCCCATTGAAAGCATCCCTTCTTATGCTTTTAACAGAATTCCTTCTTTGCGCCGACTAGACTTAGG 
GGAATTGAAAAGACTTTCATACATCTCAGAAGGTGCCTTTGAAGGTCTGTCCAACTTGAGGTATTTGAACCTTGC 
CATGTGCAACCTTCGGGAAATCCCTAACCTCACACCGCTCATAAAACTAGATGAGCTGGATCTTTCTGGGAATCA 
TTTATCTGCCATCAGGCCTGGCTCTTTCCAGGGTTTGATGCACCTTCAAAAACTGTGGATGATACAGTCCCAGAT 
TCAAGTGATTGAACGGAATGCCTTTGACAACCTTCAGTCACTAGTGGAGATCAACCTGGCACACAATAATCTAAC 
ATTACTGCCTCATGACCTCTTCACTCCCTTGCATCATCTAGAGCGGATACATTTACATCACAACCCTTGGAACTG 
TAACTGTGACATACTGTGGCTCAGCTGGTGGATAAAAGACATGGCCCCCTCGAACACAGCTTGTTGTGCCCGGTG 
TAACACTCCTCCCAATCTAAAGGGGAGGTACATTGGAGAGCTCGACCAGAATTACTTCACATGCTATGCTCCGGT 
GATTGTGGAGCCCCCTGCAGACCTCAATGTCACTGAAGGCATGGCAGCTGAGCTGAAATGTCGGGCCTCCACATC 
CCTGACATCTGTATCTTGGATTACTCCAAATGGAACAGTCATGACACATGGGGCGTACAAAGTGCGGATAGCTGT 
GCTCAGTGATGGTACGTTAAATTTCACAAATGTAACTGTGCAAGATACAGGCATGTACACATGTATGGTGAGTAA 
TTCCGTTGGGAATACTACTGCTTCAGCCACCCTGAATGTTACTGCAGCAACCACTACTCCTTTCTCTTACTTTTC 
AACCGTCACAGTAGAGACTATGGAACCGTCTCAGGATGAGGCACGGACCACAGATAACAATGTGGGTCCCACTCC 
AGTGGTCGACTGGGAGACCACCAATGTGACCACCTCTCTCACACCACAGAGCACAAGGTCGACAGAGAAAACCTT 
CACCATCCCAGTGACTGATATAAACAGTGGGATCCCAGGAATTGATGAGGTCATGAAGACTACCAAAATCATCAT 
TGGGTGTTTTGTGGCCATCACACTCATGGCTGCAGTGATGCTGGTCATTTTCTACAAGATGAGGAAGCAGCACCA 
TCGGCAAAACCATCACGCCCCAACAAGGACTGTTGAAATTATTAATGTGGATGATGAGATTACGGGAGACACACC 
CATGGAAAGCCACCTGCCCATGCCTGCTATCGAGCATGAGCACCTAAATCACTATAACTCATACAAATCTCCCTT 
CAACCACACAACAACAGTTAACACAATAAATTCAATACACAGTTCAGTGCATGAACCGTTATTGATCCGAATGAA 
CTCTAAAGACAATGTACAAGAGACTCAAATCTAAAACATTTACAGAGTTACAAAAAACAAACAATCAAAAAAAAA 
GACAGTTTATTAAAAATGACACAAATGACTGGGCTAAATCTACTGTTTCAAAAAAGTGTCTTTACAAAAAAACAA 
AAAAG AAAAG AAATTT ATT T ATT AAAAAT T CT ATT GT GAT CT AAAGC AG AC AAAAA 
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FIGURE 368 

MLNKMTLHPQQIMIGPRFNRALFDPLLWLLALQLLVVAGLVRAQTCPSVCSCSNQFSKVICVRKNLREVPDGIS 
TNTRLLNLHENQIQIIKVNSFKHLRHLEILQLSRNHIRTIEIGAFNGLANLNTLELFDNRLTTIPNGAFVYLSKL 
KELWLRNNPIES I PS YAFNRI PSLRRLDLGELKRLS Y ISEGAFEGLSNLRYLNLAMCNLREI PNLTPLIKLDELD 
LSGNHLSAIRPGSFQGLMHLQKLWMIQSQIQVIERNAFDNLQSLVEINLAHNNLTLLPHDLFTPLHHLERIHLHH 
NPWNCNCDILWLSWWIKDMAPSNTACCARCNTPPNLKGRYIGELDQNYFTCYAPVIVEPPADLNVTEGMAAELKC 
RASTSLTSVSWITPNGTVMTHGAYKVRIAVLSDGTLNFTNVTVQDTGMYTCMVSNSVGNTTASATLNVTAATTTP 
FSYFSTVTVETMEPSQDEARTTDNNVGPTPVVDWETTNVTTSLTPQSTRSTEKTFTIPVTDINSGIPGIDEVMKT 
TKIIIGCFVAITLMAAVMLVIFYKMRKQHHRQNHHAPTRTVEIINVDDEITGDTPMESHLPMPAIEHEHLNHYNS 
YKSPFNHTTTVNTINSIHSSVHEPLLIRMNSKDNVQETQI 

Important features: 
Signal sequence: 

amino acids 1-4 4 

Transmembrane domain: 

amino acids 523-543 

N-glycosylation site. 

amino acids 278-282, 364-368, 390-394, 412-416, 415-419, 434-438, 442-446, 
488-492, 606-610 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 183-187 

Gasein kinase II phosphorylation site. 

amino acids 268-272, 417-421, 465-469, 579-583, 620-624 
N-myristoylation site. 

amino acids . 40-46, 73-79, 118-124, 191-197, 228-234, 237-243, 391-397, 
422-428, 433-439, 531-537 
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FIGURE 369 

CAAAACTTGCGTCGCGGAGAGCGCCCAGCTTGACTTGAATGGAAGGAGCCCGAGCCCGCGGAGCGCAGCTGAGAC 
TGGGGGAGCGCGTTCGGCCTGTGGGGCGCCGCTCGGCGCCGGGGCGCAGCAGGGAAGGGGAAGCTGTGGTCTGCC 
CTGCTCCACGAGGCGCCACTGGTGTGAACCGGGAGAGCCCCTGGGTGGTCCCGTCCCCTATCCCTCCTTTATATA 
GAAACCTTCCACACTGGGAAGGCAGCGGCGAGGCAGGAGGGCTCATGGTGAGCAAGGAGGCCGGCTGATCTGCAG 
GCGCACAGCATTCCGAGTTTACAGATTTTTACAGATACCAAATGGAAGGCGAGGAGGCAGAACAGCCTGCCTGGT 
TCCATCAGCCCTGGCGCCCAGGCGCATCTGACTCGGCACCCCCTGCAGGCACCATGGCCCAGAGCCGGGTGCTGC 
TGCTCCTGCTGCTGCTGCCGCCACAGCTGCACCTGGGACCTGTGCTTGCCGTGAGGGCCCCAGGATTTGGCCGAA 
GTGGCGGCCACAGCCTGAGCCCCGAAGAGAACGAATTTGCGGAGGAGGAGCCGGTGCTGGTACTGAGCCCTGAGG 
AGCCCGGGCCTGGCCCAGCCGCGGTCAGCTGCCCCCGAGACTGTGCCTGTTCCCAGGAGGGCGTCGTGGACTGTG 
GCGGTATTGACCTGCGTGAGTTCCCGGGGGACCTGCCTGAGCACACC7VACCACCTATCTCTGCAGAACAACCAGC 
TGGAAAAGATCTACCCTGAGGAGCTCTCCCGGCTGCACCGGCTGGAGACACTGAACCTGCAAAACAACCGCCTGA 
CTTCCCGAGGGCTCCCAGAGAAGGCGTTTGAGCATCTGACCAACCTCAATTACCTGTACTTGGCCAATAAC7UVGC 
TGACCTTGGCACCCCGCTTCCTGCCAAACGCCCTGATCAGTGTGGACTTTGCTGCCAACTATCTCACCAAGATCT 
ATGGGCTCACCTTTGGCCAGAAGCCAAACTTGAGGTCTGTGTACCTGCACAACAACAAGCTGGCAGACGCCGGGC 
TGCCGGACAACATGTTCAACGGCTCCAGCAACGTCGAGGTCCTCATCCTGTCCAGCAACTTCCTGCGCCACGTGC 
CCAAGCACCTGCCGCCTGCCCTGTACAAGCTGCACCTCAAGAACAACAAGCTGGAGAAGATCCCCCCGGGGGCCT 
TCAGCGAGCTGAGCAGCCTGCGCGAGCTATACCTGCAGAACAACTACCTGACTGACGAGGGCCTGGACAACGAGA 
CCTTCTGGAAGCTCTCCAGCCTGGAGTACCTGGATCTGTCCAGCAACAACCTGTCTCGGGTCCCAGCTGGGCTGC 
CGCGCAGCCTGGTGCTGCTGCACTTGGAGAAGAACGCCATCCGGAGCGTGGACGCGAATGTGCTGACCCCCATCC 
GCAGCCTGGAGTACCTGCTGCTGCACAGCAACCAGCTGCGGGAGCAGGGCATCCACCCACTGGCCTTCCAGGGCC 
TCAAGCGGTTGCACACGGTGCACCTGTACAACAACGCGCTGGAGCGCGTGCCCAGTGGCCTGCCTCGCCGCGTGC 
GCACCCTCATGATCCTGCACAACCAGATCACAGGCATTGGCCGCGAAGACTTTGCCACCACCTACTTCCTGGAGG 
AGCTCAACCTCAGCTACAACCGCATCACCAGCCCACAGGTGCACCGCGACGCCTTCCGCAAGCTGCGCCTGCTGC 
GCTCGCTGGACCTGTCGGGCAACCGGCTGCACACGCTGCCACCTGGGCTGCCTCGAAATGTCCATGTGCTGAAGG 
TCAAGCGCAATGAGCTGGCTGCCTTGGCACGAGGGGCGCTGGCGGGCATGGCTCAGCTGCGTGAGCTGTACCTCA 
CCAGCAACCGACTGCGCAGCCGAGCCCTGGGCCCCCGTGCCTGGGTGGACCTCGCCCATCTGCAGCTGCTGGACA 
TCGCCGGGAATCAGCTCACAGAGATCCCCGAGGGGCTCCCCGAGTCACTTGAGTACCTGTACCTGCAGAACAACA 
AGATTAGTGCGGTGCCCGCCAATGCCTTCGACTCCACGCCCAACCTCAAGGGGATCTTTCTCAGGTTTAACAAGC 
TGGCTGTGGGCTCCGTGGTGGACAGTGCCTTCCGGAGGCTGAAGCACCTGCAGGTCTTGGACATTGAAGGCAACT 
TAGAGTTTGGTGACATTTCCAAGGACCGTGGCCGCTTGGGGAAGGAAAAGGAGGAGGAGGAAGAGGAGGAGGAGG 
AGGAAGAGGAAACAAG ATAG TGACAAGGTGATGCAGATGTGACCTAGGATGATGGACCGCCGGACTCTTTTCTGC 
AGCACACGCCTGTGTGCTGTGAGCCCCCCACTCTGCCGTGCTCACACAGACACACCCAGCTGCACACATGAGGCA 
TCCCACATGACACGGGCTGACACAGTCTCATATCCCCACCCCTTCCCACGGCGTGTCCCACGGCCAGACACATGC 
ACACACATCACACCCTCAAACACCCAGCTCAGCCACACACAACTACCCTCCAAACCACCACAGTCTCTGTCACAC 
CCCCACTACCGCTGCCACGCCCTCTGAATCATGCAGGGAAGGGTCTGCCCCTGCCCTGGCACACACAGGCACCCA 
TTCCCTCCCCCTGCTGACATGTGTATGCGTATGCATACACACCACACACACACACATGCACAAGTCATGTGCGAA 
CAGCCCTCCAAAGCCTATGCCACAGACAGCTCTTGCCCCAGCCAGAATCAGCCATAGCAGCTCGCCGTCTGCCCT 
GTCCATCTGTCCGTCCGTTCCCTGGAGAAGACACAAGGGTATCCATGCTCTGTGGCCAGGTGCCTGCCACCCTCT 
GGAACTCACAAAAGCTGGCTTTTATTCCTTTCCCATCCTATGGGGACAGGAGCCTTCAGGACTGCTGGCCTGGCC 
TGGCCCACCCTGCTCCTCCAGGTGCTGGGCAGTCACTCTGCTAAGAGTCCCTCCCTGCCACGCCCTGGCAGGACA 
CAGGCACTTTTCCAATGGGCAAGCCCAGTGGAGGCAGGATGGGAGAGCCCCCTGGGTGCTGCTGGGGCCTTGGGG 
CAGGAGTGAAGCAGAGGTGATGGGGCTGGGCTGAGCCAGGGAGGAAGGACCCAGCTGCACCTAGGAGACACCTTT 
GTTCTTCAGGCCTGTGGGGGAAGTTCCGGGTGCCTTTATTTTTTATTCTTTTCTAAGGAAAAAAATGATAAAAAT 
CTCAAAGCTGATTTTTCTTGTTATAGAAAAACTAATATAAAAGCATTATCCCTATCCCTGCAAAAAAAAAA 
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FIGURE 370 

MEGEEAEQPAWFHQPWRPGASDSAPPAGTMAQSRVLLLLLLLPPQLHLGPVLAVRAPGFGRSG 
GHSLSPEENEFAEEEPVLVLSPEEPGPGPAAVSCPRDCACSQEGVVDCGGIDLREFPGDLPEH 
TNHLSLQNNQLEKIYPEELSRLHRLETLNLQNNRLTSRGLPEKAFEHLTNLNYLYLANNKLTL 
APRFLPNALISVDFAANYLTKIYGLTFGQKPNLRSVYLHNNKLADAGLPDNMFNGSSNVEVLI 
LSSNFLRHVPKHLPPALYKLHLKNNKLEKIPPGAFSELSSLRELYLQNNYLTDEGLDNETFWK 
LSSLEYLDLSSNNLSRVPAGLPRSLVLLHLEKNAIRSVDANVLTPIRSLEYLLLHSNQLREQG 
IHPLAFQGLKRLHTVHLYNNALERVPSGLPRRVRTLMILHNQITGIGREDFATTYFLEELNLS 
YNRITSPQVHRDAFRKLRLLRSLDLSGNRLHTLPPGLPRNVHVLKVKRNELAALARGALAGMA 
QLRELYLTSNRLRSRALGPRAWVDLAHLQLLDIAGNQLTEIPEGLPESLEYLYLQNNKISAVP 
ANAFDSTPNLKGIFLRFNKLAVGSVVDSAFRRLKHLQVLDIEGNLEFGDISKDRGRLGKEKEE 
EEEEEEEEEETR 

Important features: 
Signal sequence : 

amino acids 1-48 

N-glycosylation site. 

amino acids 243-247, 310-314, 328-332, 439-443 
Casein kinase II phosphorylation site. 

amino acids 68-72, 84-88, 246-250, 292-296, 317-321, 591-595 
N-myristoylation site. 

amino acids 19-25, 107-113, 213-219, 217-223, 236-242, 335-341, 
477-483, 498-502, 539-545, 548-554 

Leucine zipper pattern. 

amino acids 116-138, 251-273, 258-280, 322-344, 464-486, 471-493, 
535-557 
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FIGURE 371 

CACTTTCTCCCTCTCTTCCTTTACTTTCGAGAAACCGCGCTTCCGCTTCTGGTCGCAGAGACCTCGGAGACCGCG 
CCGGGGAGACGGAGGTGCTGTGGGTGGGGGGGACCTGTGGCTGCTCGTACCGCCCCCCACCCTCCTCTTCTGCAC 
TGCCGTCCTCCGGAAGACCTTTTCCCCTGCTCTGTTTCCTTCACCGAGTCTGTGCATCGCCCCGGACCTGGCCGG 
GAGGAGGCTTGGCCGGCGGGAGATGCTCTAGGGGCGGCGCGGGAGGAGCGGCCGGCGGGACGGAGGGCCCGGCAG 
GAAGATGGGCTCCCGTGGACAGGGACTCTTGCTGGCGTACTGCCTGCTCCTTGCCTTTGCCTCTGGCCTGGTCCT 
GAGTCGTGTGCCCCATGTCCAGGGGGAACAGCAGGAGTGGGAGGGGACTGAGGAGCTGCCGTCGCCTCCGGACCA 
TGCCGAGAGGGCTGAAGAACAACATGAAAAATACAGGCCCAGTCAGGACCAGGGGCTCCCTGCTTCCCGGTGCTT 
GCGCTGCTGTGACCCCGGTACCTCCATGTACCCGGCGACCGCCGTGCCCCAGATCAACATCACTATCTTGAAAGG 
GGAGAAGGGTGACCGCGGAGATCGAGGCCTCCAAGGGAAATATGGCAAAACAGGCTCAGCAGGGGCCAGGGGCCA 
CACTGGACCCAAAGGGCAGAAGGGCTCCATGGGGGCCCCTGGGGAGCGGTGCAAGAGCCACTACGCCGCCTTTTC 
GGTGGGCCGGAAGAAGCCCATGCACAGCAACCACTACTACCAGACGGTGATCTTCGACACGGAGTTCGTGAACCT 
CTACGACCACTTCAACATGTTCACCGGCAAGTTCTACTGCTACGTGCCCGGCCTCTACTTCTTCAGCCTCAACGT 
GCACACCTGGAACCAGAAGGAGACCTACCTGCACATCATGAAGAACGAGGAGGAGGTGGTGATCTTGTTCGCGCA 
GGTGGGCGACCGCAGCATCATGCAAAGCCAGAGCCTGATGCTGGAGCTGCGAGAGCAGGACCAGGTGTGGGTACG 
CCTCTACAAGGGCGAACGTGAG7U\CGCCATCTTCAGCGAGGAGCTGGACACCTACATCACCTTCAGTGGCTACCT 
GGTCAAGCACGCCACCGAGCCCTAGCTGGCCGGCCACCTCCTTTCCTCTCGCCACCTTCCACCCCTGCGCTGTGC 
TGACCCCACCGCCTCTTCCCCGATCCCTGGACTCCGACTCCCTGGCTTTGGCATTCAGTGAGACGCCCTGCACAC 
ACAGAAAGCCAAAGCGATCGGTGCTCCCAGATCCCGCAGCCTCTGGAGAGAGCTGACGGCAGATGAAATCACCAG 
GGCGGGGCACCCGCGAGAACCCTCTGGGACCTTCCGCGGCCCTCTCTGCACACATCCTCAAGTGACCCCGCACGG 
CGAGACGCGGGTGGCGGCAGGGCGTCCCAGGGTGCGGCACCGCGGCTCCAGTCCTTGGAAATAATTAGGCAAATT 
CTAAAGGTCTCAAAAGGAGCAAAGTAAACCGTGGAGGACAAAGAAAAGGGTTGTTATTTTTGTCTTTCCAGCCAG 
CCTGCTGGCTCCCAAGAGAGAGGCCTTTTCAGTTGAGACTCTGCTTAAGAGAAGATCCAAAGTTAAAGCTCTGGG 
GTCAGGGGAGGGGCCGGGGGCAGGAAACTACCTCTGGCTTAATTCTTTTAAGCCACGTAGGAACTTTCTTGAGGG 
ATAGGTGGACCCTGACATCCCTGTGGCCTTGCCCAAGGGCTCTGCTGGTCTTTCTGAGTCACAGCTGCGAGGTGA 
TGGGGGCTGGGGCCCCAGGCGTCAGCCTCCCAGAGGGACAGCTGAGCCCCCTGCCTTGGCTCCAGGTTGGTAGAA 
GCAGCCGAAGGGCTCCTGACAGTGGCCAGGGACCCCTGGGTCCCCCAGGCCTGCAGATGTTTCTATGAGGGGCAG 
AGCTCCTTGGTACATCCATGTGTGGCTCTGCTCCACCCCTGTGCCACCCCAGAGCCCTGGGGGGTGGTCTCCATG 
CCTGCCACCCTGGCATCGGCTTTCTGTGCCGCCTCCCACACAAATCAGCCCCAGAAGGCCCCGGGGCCTTGGCTT 
CTGTTTTTTATAAT^ACACCTCAAGCAGCACTGCAGTCTCCCATCTCCTCGTGGGCTAAGCATCACCGCTTCCACG 
TGTGTTGTGTTGGTTGGCAGCAAGGCTGATCCAGACCCCTTCTGCCCCCACTGCCCTCATCCAGGCCTCTGACCA 
GTAGCCTGAGAGGGGCTTTTTCTAGGCTTCAGAGCAGGGGAGAGCTGGAAGGGGCTAGAAAGCTCCCGCTTGTCT 
GTTTCTCAGGCTCCTGTGAGCCTCAGTCCTGAGACCAGAGTCAAGAGGAAGTACACGTCCCAATCACCCGTGTCA 
GGATTCACTCTCAGGAGCTGGGTGGCAGGAGAGGCAATAGCCCCTGTGGCAATTGCAGGACCAGCTGGAGCAGGG 
TTGCGGTGTCTCCACGGTGCTCTCGCCCTGCCCATGGCCACCCCAGACTCTGATCTCCAGGAACCCCATAGCCCC 
TCTCCACCTCACCCCATGTTGATGCCCAGGGTCACTCTTGCTACCCGCTGGGCCCCCAAACCCCCGCTGCCTCTC 
TTCCTTCCCCCCATCCCCCACCTGGTTTTGACTAATCCTGCTTCCCTCTCTGGGCCTGGCTGCCGGGATCTGGGG 
TCCCTAAGTCCCTCTCTTTAAAGAACTTCTGCGGGTCAGACTCTGAAGCCGAGTTGCTGTGGGCGTGCCCGGAAG 
CAGAGCGCCACACTCGCTGCTTAAGCTCCCCCAGCTCTTTCCAGAAAACATTAAACTCAGAATTGTGTTTTCAA 
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FIGURE 372 

MGSRGQGLLLAYCLLLAFASGLVLSRVPHVQGEQQEWEGTEELPSPPDHAERAEEQHEKYRPS 
QDQGLPASRCLRCCDPGTSMYPATAVPQINITILKGEKGDRGDRGLQGKYGKTGSAGARGHTG 
PKGQKGSMGAPGERCKSHYAAFSVGRKKPMHSNHYYQTVIFDTEFVNLYDHFNMFTGKFYCYV 
PGLYFFSLNVHTWNQKETYLHIMKNEEEWILFAQVGDRSIMQSQSLMLELREQDQVWVRLYK 
GERENAIFSEELDTYITFSGYLVKHATEP 

Important features: 
Signal sequence. 

amino acids 1-25 

N-glycosylation site. 

amino acids 93-97 

N-myristoylation sites . 

amino acids 7-13, 21-27, 67-73, 117-123, 129-135 

Am i da t ion site. 

amino acids 150-154 

Cell attachment sequence. 

amino acids 104-107 
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FIGURE 373 

CGGAGTGGTGCGCCAACGTGAGAGGAAACCCGTGCGCGGCTGCGCTTTCCTGTCCCCAAGCCG 
TTCTAGACGCGGGT^AAAATGCTTTCTGAAAGCAGCTCCTTTTTGAAGGGTGTGATGCTTGGAA 
GCATTTTCTGTGCTTTGATCACTATGCTAGGACACATTAGGATTGGTCATGGAAATAGAATGC 
ACCACCATGAGCATCATCACCTACAAGCTCCTAACAAAGAAGATATCTTGAAAATTTCAGAGG 
ATGAGCGCATGGAGCTCAGTAAGAGCTTTCGAGTATACTGTATTATCCTTGTAAAACCCAAAG 
ATGTGAGTCTTTGGGCTGCAGTAAAGGAGACTTGGACCAAACACTGTGACAAAGCAGAGTTCT 
TCAGTTCTGAAAATGTTAAAGTGTTTGAGTCAATTAATATGGACACAAATGACATGTGGTTAA 
TGATGAGAAAAGCTTACAAATACGCCTTTGATAAGTATAGAGACCAATACAACTGGTTCTTCC 
TTGCACGCCCCACTACGTTTGCTATCATTGAAAACCTAAAGTATTTTTTGTTAAAAAAGGATC 
CATCACAGCCTTTCTATCTAGGCCACACTATAAAATCTGGAGACCTTGAATATGTGGGTATGG 
AAGGAGGAATTGTCTTAAGTGTAGAATCAATGAAAAGACTTAACAGCCTTCTCAATATCCCAG 
AAAAGTGTCCTGAACAGGGAGGGATGATTTGGAAGATATCTGAAGATAAACAGCTAGCAGTTT 

GCCTGAAATATGCTGGAGTATTTGCAGAAAATGCAGAAGATGCTGATGGAAAAGATGTATTTA 

ATACCAAATCTGTTGGGCTTTCTATTAAAGAGGCAATGACTTATCACCCCAACCAGGTAGTAG 

AAGGCTGTTGTTCAGATATGGCTGTTACTTTTAATGGACTGACTCCAAATCAGATGCATGTGA 

TGATGTATGGGGTATACCGCCTTAGGGCATTTGGGCATATTTTCAATGATGCATTGGTTTTCT 

TACCTCCAAATGGTTCTGACAATGACTGAGAAGTGGTAGAAAAGCGTGAATATGATCTTTGTA 

TAGGACGTGTGTTGTCATTATTTGTAGTAGTAACTACATATCCAATACAGCTGTATGTTTCTT ' 

TTTCTTTTCTAATTTGGTGGCACTGGTATAACCACACATTAAAGTCAGTAGTACATTTTTAAA 

TGAGGGTGGTTTTTTTCTTTAAAACACATGAACATTGTAAATGTGTTGGAAAGAAGTGTTTTA 

AGAATAATAATTTTGCAAATAAACTATTAATAAATATTATATGTGATAAATTCTAAATTATGA 

ACATTAGAAATCTGTGGGGCACATATTTTTGCTGATTGGTTAAAAAATTTTAACAGGTCTTTA 

GCGTTCTAAGATATGCAAATGATATCTCTAGTTGTGAATTTGTGATTAAAGTAAAACTTTTAG 

CTGTGTGTTCCCTTTACTTCTAATACTGATTTATGTTCTAAGCCTCCCCAAGTTCCAATGGAT 

TTGCCTTCTCAAAATGTACAACTAAGCAACTAAAGAAAATTAAAGTGAAAGTTGAAAAAT . 
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FIGURE 374 

MLSESSSFLKGVMLGSIFCALITMLGHIRIGHGNRMHHHEHHHLQAPNKEDILKISEDERMELSKSFRVYCIILV 
KPKDVSLWAAVKETWTKHCDKAEFFSSENVKVFESINMDTNDMWLMMRKAYKYAFDKYRDQYNWFFLARPTTFAI 
IENLKYFLLKKDPSQPFYLGHTIKSGDLEYVGMEGGIVLSVESMKRLNSLLNIPEKCPEQGGMIWKISEDKQLAV 
CLKYAGVFAENAEDADGKDVFNTKSVGLSIKEAMTYHPNQWEGCCSDMAVTFNGLTPNQMHVMMYGVYRLRAFG 
HIFNDALVFLPPNGSDND 

Important features: 
Signal sequence: 

amino acids 1-33 

N-glycosylation site. 

amino acids 121-125, 342-346 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 319-323 f 464-468 

Casein kinase II phosphorylation site. 

amino acids 64-132, 150-154, 322-326, 331-335, 368-372, 385-389, 399-403, 
409-413, 473-477,. 729-733, 748-752 

Tyrosine kinase phosphorylation site. 

amino acids 736-743 

N-myristoylation site. 

amino acids 19-25, 23-29, 136-142, 397-403, 441-447, 544-550, 558-564, 
651-657, 657-663, 672-672 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 14-25 

Cell attachment sequence. 

amino acids 247-250 
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FIGURE 375 

GTTGTGTCCTTCAGCAAAACAGTGGATTTAAATCTCCTTGCACAAGCTTGAGAGCAACACAAT 

CTATCAGGAAAGAAAGAAAGAAAAAAACCGAACCTGACAAAAAAGAAGAAAAAGAAGAAGAAA 

AAAAATCMGAAAACCATCCAGCCAAAAATGCACAATTCTATCTCTTGGGCAATCTTCACGGG 

GCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTTCCCCAA 

AGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTATTGACAA 

CCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAATGACAAGTG 

GTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCATCGAGATCCA 

GAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACAACCACCCAAA 

GACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATTTCTTCAGATAT 

CTCCATTAATGAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAGACCAGAGCCTAC 

GGTTACTTGGAGACACATCTCTCCCAAAGCGGTTGGCTTTGTGAGTGAAGACGAATACTTGGA 

AATTCAGGGCATCACCCGGGAGCAGTCAGGGGACTACGAGTGCAGTGCCTCCAATGACGTGGC 

CGCGCCCGTGGTACGGAGAGTAAAGGTCACCGTGAACTATCCACCATACATTTCAGAAGCCAA 

GGGTACAGGTGTCCCCGTGGGACAAAAGGGGACACTGCAGTGTGAAGCCTCAGCAGTCCCCTC 

AGCAGAATTCCAGTGGTACAAGGATGACAAAAGACTGATTGAAGGAAAGAAAGGGGTGAAAGT 

GGAAAACAGACCTTTCCTCTCAAAACTCATCTTCTTCAATGTCTCTGAACATGACTATGGGAA 

CTACACTTGCGTGGCCTCCAACAAGCTGGGCCACACCAATGCCAGCATCATGCTATTTGGTCC 

AGGCGCCGTCAGCGAGGTGAGCAACGGCACGTCGAGGAGGGCAGGCTGCGTGTGGCTGCTGCC 

TCTTCTGGTCTTGCACCTGCTTCTCAAATTT2S&TGTGAGTGCCACTTCCCCACCCGGGAAAG 

GCTGCCGCCACCACCACCACCAACACAACAGCAATGGCAACACCGACAGCAACCAATCAGATA 

TATACAAATGAAATTAGAAGAAACACAGCCTCATGGGACAGAAATTTGAGGGAGGGGAACAAA 

GAATACTTTGGGGGGAAAAGAGTTTTAAAAAAGAAATTGAAAATTGCCTTGCAGATATTTAGG 

TACAATGGAGTTTTCTTTTCCCAAACGGGAAGAACACAGCACACCCGGCTTGGACCCACTGCA 

AGCTGCATCGTGCAACCTCTTTGGTGCCAGTGTGGGCAAGGGCTCAGCCTCTCTGCCCACAGA 

GTGCCCCCACGTGGAACATTCTGGAGCTGGCCATCCCAAATTCAATCAGTCCATAGAGACGAA 

CAGAATGAGACCTTCCGGCCCAAGCGTGGCGCTGCGGGCACTTTGGTAGACTGTGCCACCACG 

GCGTGTGTTGTGAAACGTGAAATAAAAAGAGCAAAAAAAAA 
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FIGURE 376 

MKTIQPKMHNSISWAIFTGLAALCLFQGVPVRSGDATFPKAMDNVTVRQGESATLRCTIDNRV 
TRVAWLNRSTILYAGNDKWCLDPRWLLSNTQTQYSIEIQNVDVYDEGPYTCSVQTDNHPKTS 
RVHLIVQVSPKIVEISSDISINEGNNISLTCIATGRPEPTVTWRHISPKAVGFVSEDEYLEIQ 
GITREQSGDYECSASNDVAAPWRRVKVTVNYPPYISEAKGTGVPVGQKGTLQCEASAVPSAE 
FQW YKDDKRL I EGKKGVKVEN RP FLS KL I FFN VSEHDYGN YTC VASNKLGHTNAS IML FGPGA 
VSEVSNGTSRRAGCVWLLPLLVLHLLLKF 

Important features: 
Signal peptide: 

amino acids 1-28 
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FIGURE 377 

CTTCTTTGAAAAGGATTATCACCTGATCAGGTTCTCTCTGCATTTGCCCCTTTAGATTGTGAA 
ATGTGGCTCAAGGTCTTCACAACTTTCCTTTCCTTTGCAACAGGTGCTTGCTCGGGGCTGAAG 
GTGACAGTGCCATCACACACTGTCCATGGCGTCAGAGGTCAGGCCCTCTACCTACCCGTCCAC 
TATGGCTTCCACACTCCAGCATCAGACATCCAGATCATATGGCTATTTGAGAGACCCCACACA 
ATGCCCAAATACTTACTGGGCTCTGTGAATAAGTCTGTGGTTCCTGACTTGGAATACCAACAC 
AAGTTCACCATGATGCCACCCAATGCATCTCTGCTTATCAACCCACTGCAGTTCCCTGATGAA 
GGCAATTACATCGTGAAGGTCAACATTCAGGGAAATGGAACTCTATCTGCCAGTCAGAAGATA 
CAAGTCACGGTTGATGATCGTGTCACAAAGCCAGTGGTGCAGATTCATCCTCCCTCTGGGGCT 
GTGGAGTATGTGGGGAACATGACCCTGACATGCCATGTGGAAGGGGGCACTCGGCTAGCTTAC 
CAATGGCTAAAAAATGGGAGACCTGTCCACACCAGCTCCACCTACTCCTTTTCTCCCCAAAAC 
AATACCCTTCATATTGCTCCAGTAACCAAGGAAGACATTGGGAATTACAGCTGCCTGGTGAGG 
AACCCTGTCAGTGAAATGGAAAGTGATATCATTATGCCCATCATATATTATGGACCTTATGGA 
CTTCAAGTGAATTCTGATAAAGGGCTAAAAGTAGGGGAAGTGTTTACTGTTGACCTTGGAGAG 
GCCATCCTATTTGATTGTTCTGCTGATTCTCATCCCCCCAACACCTACTCCTGGATTAGGAGG 
ACTGACAATACTACATATATCATTAAGCATGGGCCTCGCTTAGAAGTTGCATCTGAGAAAGTA 
GCGCAGAAGACAATGGACTATGTGTGCTGTGCTTACAACAACATAACCGGCAGGCAAGATGAA 
ACTCATTTCACAGTTATCATCACTTCCGTAGGACTGGAGAAGCTTGCACAGAAAGGAAAATCA 
TTGTCACCTTTAGCAAGTATAACTGGAATATCACTATTTTTGATTATATCCATGTGTCTTCTC 
TTCCTATGGAAAAAATATCAACCCTACAAAGTTATAAAACAGAAACTAGAAGGCAGGCCAGAA 
ACAGAATACAGGAAAGCTCAAACATTTTCAGGCCATGAAGATGCTCTGGATGACTTCGGAATA 
TATGAATTTGTTGCTTTTCCAGATGTTTCTGGTGTTTCCAGGATTCCAAGCAGGTCTGTTCCA 
GCCTCTGATTGTGTATCGGGGCAAGATTTGCACAGTACAGTGTATGAAGTTATTCAGCACATC 
CCTGCCCAGCAGCAAGACCATCCAGAGTGAACTTTCATGGGCTAAACAGTACATTCGAGTGAA 
ATTCTGAAGAAACATTTTAAGGAAAAACAGTGGAAAAGTATATTAATCTGGAATCAGTGAAGA 
AACCAGGACCAACACCTCTTACTCATTATTCCTTTACATGCAGAATAGAGGCATTTATGCAAA 
TTGAACTGCAGGTTTTTCAGCATATACACAATGTCTTGTGCAACAGAAAAACATGTTGGGGAA 
ATATTCCTCAGTGGAGAGTCGTTCTCATGCTGACGGGGAGAACGAAAGTGACAGGGGTTTCCT 
CATAAGTTTTGTATGAAATATCTCTACAAACCTCAATTAGTTCTACTCTACACTTTCACTATC 
ATCAACACTGAGACTATCCTGTCTCACCTACAAATGTGGAAACTTTACATTGTTCGATTTTTC 
AGCAGACTTTGTTTTATTAAATTTTTATTAGTGTTAAGAATGCTAAATTTATGTTTCAATTTT 
ATTTCCAAATTTCTATCTTGTTATTTGTACAACAAAGTAATAAGGATGGTTGTCACAAAAACA 
AAACTATGCCTTCTCTTTTTTTTCAATCACCAGTAGTATTTTTGAGAAGACTTGTGAACACTT 
AAGGAAATGACTATTAAAGTCTTATTTTTATTTTTTTCAAGGAAAGATGGATTCAAATAAATT 
ATTCTGTTTTTGCTTTTAAAAAAAAAAAAAA 
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FIGURE 378 

MWLKVFTTFLSFATGACSGLKVTVPSHTVHGVRGQALYLPVHYGFHTPASDIQIIWLFERPHTMPKYLLGSVNKS 
VVPDLEYQHKFTMMPPNASLLINPLQFPDEGNYIVKVNIQGNGTLSASQKIQVTVDDPVTKPVVQIHPPSGAVEY 
VGNMTLTCHVEGGTRLAYQWLKNGRPVHTSSTYSFSPQNNTLHIAPVTKEDIGNYSCLVRNPVSEMESDIIMPII 
YYGPYGLQVNSDKGLKVGEVFTVDLGEAILFDCSADSHPPNTYSWIRRTDNTTYIIKHGPRLEVASEECVAQKTMD 
YVCCAYNNITGRQDETHFTVIITSVGLEKLAQKGKSLSPLASITGISLFLIISMCLLFLWKKYQPYKVIKQKLEG 
RPETEYRKAQTFSGHEDALDDFGIYEFVAFPDVSGVSRIPSRSVPASDCVSGQDLHSTVYEVIQHIPAQQQDHPE 

Important features: 
Signal sequence: 

amino acids 1-18 

Transmembrane domain: 

amino acids 341-359 

N-glycosylation site. 

amino acids 73-77, 92-96, 117-121, 153-157, 189-193, 204-208, 276-280, 308-312 
Casein kinase II phosphorylation site. 

amino acids 129-133, 198-202, 214-218, 388-392, 426-430, 433-437 

Tyrosine kinase phosphorylation site. 

amino acids 272-280 

N-myristoylation site. 

amino acids 15-21, 19-25, 118-124, 163-167, 203-209, 231-237, 239-245 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 7-18 
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FIGURE 379 

ATAGTAGAAGAATGTCTCTGAAATTACTGGATGAGTTTCAGTCATACTTTCACATGGGCACAA 
TTTCACATTCAAGCTCCTTATCCTAGGCTAATTTTATATTATGTTAAATCACTTGTTTTTGTT 
CTCACGGCTTCCTGCCTGCTATAGGCATAATTACGAGGAAGCAGAACTTCTCCAGAAGCAAGC 
GCACATGCGTTCCAAAATAAGAGCAAATTCGCTCTAAACACAGGAAAAGACCTGAAGCTTTAA 
TTAAGGGGTTACATCCAACCCCAGAGCGCTTTTGTGGGCACTGATTGCTCCAGCTTCTGCGTC 
ACTGCGCGAGGGAAGAGGGAAGAGGATCCAGGCGTTAGACATGTATAGACACAAAAACAGCTG 
GAGATTGGGCTTAAAATACCCACCAAGCTCCAAAGAAGAGACCCAAGTCCCCAAAACATTGAT 
TTCAGGGCTGCCAGGAAGGAAGAGCAGCAGCAGGGTGGGAGAGAAGCTCCAGTCAGCCCACAA 
GATGCCATTGTCCCCCGGCCTCCTGCTGCTGCTGCTCTCCGGGGCCACGGCCACCGCTGCCCT 
GCCCCTGGAGGGTGGCCCCACCGGCCGAGACAGCGAGCATATGCAGGAAGCGGCAGGAATAAG 
GAAAAGCAGCCTCCTGACTTTCCTCGCTTGGTGGTTTGAGTGGACCTCCCAGGCCAGTGCCGG 
GCCCCTCATAGGAGAGGAAGCTCGGGAGGTGGCCAGGCGGCAGGAAGGCGCACCCCCCCAGCA 
ATCCGCGCGCCGGGACAGAATGCCCTGCAGGAACTTCTTCTGGAAGACCTTCTCCTCCTGCAA 

ATAG 
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FIGURE 380 

MYRHKNSWRLGLKYPPSSKEETQVPKTLISGLPGRKSSSRVGEKLQSAHKMPLSPGLLLLLLS 
GATATAALPLEGGPTGRDSEHMQEAAGIRKSSLLTFLAWWFEWTSQASAGPLIGEEAREVARR 
QEGAPPQQSARRDRMPCRNFFWKTFSSCK 

Important features: 
Transmembrane domain: 

amino acids 51-69 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 35-39, 92-96 

N-myristoylation sites . 

amino acids 64-70, 75-81, 90-96 



Amidation site. 

amino acids 33-37 
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FIGURE 381 

GGCGCCGGTGCACCGGGCGGGCTGAGCGCCTCCTGCGGCCCGGCCTGCGCGCCCCGGCCCGCC 

GCGCCGCCCACGCCCCAACCCCGGCCCGCGCCCCCTAGCCCCCGCCCGGGCCCGCGCCCGCGC 

CCGCGCCCAGGTGAGC'GCTCCGCCCGCCGCGAGGCCCCGCCCCGGCCCGCCCCCGCCCCGCCC 

CGGCCGGCGGGGGAACCGGGCGGATTCCTCGCGCGTCAAACCACCTGATCCCATAAAACATTC 

ATCCTCPCGGCGGCCCGCGCTGCGAGCGCCCCGCCAGTCCGCGCCGCCGCCGCCCTCGCCCTG 

TGCGCCCTGCGCGCCCTGCGCACCCGCGGCCCGAGCCCAGCCAGAGCCGGGCGGAGCGGAGCG 

CGCCGAGCCTCGTCCCGCGGCCGGGCCGGGGCCGGGCCGTAGCGGCGGCGCCTGGATGCGGAC 

CCGGCCGCGGGGAGACGGGCGCCCGCCCCGAAACGACTTTCAGTCCCCGACGCGCCCCGCCCA 

ACCCCTACGATGAAGAGGGCGTCCGCTGGAGGGAGCCGGCTGCTGGCATGGGTGCTGTGGCTG 

CAGGCCTGGCAGGTGGCAGCCCCATGCCCAGGTGCCTGCGTATGCTACAATGAGCCCAAGGTG 

ACGACAAGCTGCCCCCAGCAGGGCCTGCAGGCTGTGCCCGTGGGCATCCCTGCTGCCAGCCAG 

CGCATCTTCCTGCACGGCAACCGCATCTCGCATGTGCCAGCTGCCAGCTTCCGTGCCTGCCGC 

AACCTCACCATCCTGTGGCTGCACTCGAATGTGCTGGCCCGAATTGATGCGGCTGCCTTCACT 

GGCCTGGCCCTCCTGGAGCAGCTGGACCTCAGCGATAATGCACAGCTCCGGTCTGTGGACCCT 

GCCACATTCCACGGCCTGGGCCGCCTACACACGCTGCACCTGGACCGCTGCGGCCTGCAGGAG 

CTGGGCCCGGGGCTGTTCCGCGGCCTGGCTGCCCTGCAGTACCTCTACCTGCAGGACAACGCG 

CTGCAGGCACTGCCTGATGACACCTTCCGCGACCTGGGCAACCTCACACACCTCTTCCTGCAC 

GGCAACCGCATCTCCAGCGTGCCC.GAGCGCGCCTTCCGTGGGCTGCACAGCCTCGACCGTCTC 

CTACTGCACCAGAACCGCGTGGCCCATGTGCACCCGCATGCCTTCCGTGACCTTGGCCGCCTC 

ATGACACTCTATCTGTTTGCCAACAATCTATCAGCGCTGCCCACTGAGGCCCTGGCCCCCCTG 

CGTGCCCTGCAGTACCTGAGGCTCAACGACAACCCCTGGGTGTGTGACTGCCGGGCACGCCCA 

CTCTGGGCCTGGCTGCAGAAGTTCCGCGGCTCCTCCTCCGAGGTGCCCTGCAGCCTCCCGCAA 

CGCCTGGCTGGCCGTGACCTCAAACGCCTAGCTGCCAATGACCTGCAGGGCTGCGCTGTGGCC 

ACCGGCCCTTACCATCCCATCTGGACCGGCAGGGCCACCGATGAGGAGCCGCTGGGGCTTCCC 

AAGTGCTGCCAGCCAGATGCCGCTGACAAGGCCTCAGTACTGGAGCCTGGAAGACCAGCTTCG 

GCAGGCAATGCGCTGAAGGGACGCGTGCCGC.CCGGTGACAGCCCGCCGGGCAACGGCTCTGGC, 

CCACGGCACATCAATGACTCACCCTTTGGGACTCTGCCTGGCTCT.GCTGAGCCCCCGCTCACT 

GCAGTGCGGCCCGAGGGCTCCGAGCCACCAGGGTTCCCCACCTCGGGCCCTCGCCGGAGGCCA 

GGCTGTTCACGCAAGAACCGCACCCGCAGCCACTGCCGTCTGGGCCAGGCAGGCAGCGGGGGT 

GGCGGGACTGGTGACTCAGAAGGCTCAGGTGCCCTACCCAGCCTCACCTGCAGCCTCACCCCC 

CTGGGCCTGGCGCTGGTGCTGTGGACAGTGCTTGGGCCCTGC TGA CCCCCAGCGGACACAAGA 

GCGTGCTCAGCAGCCAGGTGTGTGTACATACGGGGTCTCTCTCCACGCCGCCAAGCCAGCCGG 

GCGGCCGACCCGTGGGGCAGGCCAGGCCAGGTCCTCCCTGATGGACGCCTGCCGCCCGCCACC 

CCCATCTCCACCCCATCATGTTTACAGGGTTCGGCGGCAGCGTTTGTTCCAGAACGCCGCCTC 

CCACCCAGATCGCGGTATATAGAGATATGCATTTTATTTTACTTGTGTAAAAATATCGGACGA 

CGTGGAATAAAGAGCTCTTTTCTTAAAAAAA 
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FIGURE 382 

MKRASAGGSRLLAWVLWLQAWQVAAPCPGACVCYNEPKVTTSCPQQGLQAVPVGIPAASQRIF 
LHGNRISHVPAASFRACRNLTILWLHSNVLARIDAAAFTGLALLEQLDLSDNAQLRSVDPATF 
HGLGRLHTLHLDRCGLQELGPGLFRGLAALQYLYLQDNALQALPDDTFRDLGNLTHLFLHGNR 
ISSVPERAFRGLHSLDRLLLHQNRVAHVHPHAFRDLGRLMTLYLFANNLSALPTEALAPLRAL 
QYLRLNDNPWVCDCRARPLWAWLQKFRGSSSEVPCSLPQRLAGRDLKRLAANDLQGCAVATGP 
YHPIWTGRATDEEPLGLPKCCQPDAADKASVLEPGRPASAGNALKGRVPPGDSPPGNGSGPRH 
INDSPFGTLPGSAEPPLTAVRPEGSEPPGFPTSGPRRRPGCSRKNRTRSHCRLGQAGSGGGGT 
GDSEGSGALPSLTCSLTPLGI-ALVLWTVLGPC 

Important features: 
Signal peptide: 

amino acids 1-26 

Leucine zipper pattern. 

amino acids 135-156 

Glycosaminoglycan attachment site . 

amino acids 436-439 # 

N-glycosylation site. 

amino acids 82-85, 179-183, 237-240, 372-375 and 423-426 

VWFC domain 

amino acids 411-425 
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FIGURE 383 

TTCGTGACCCTTGAGAAAAGAGTTGGTGGTAAATGTGCCACGTCTTCTAAGAAGGGGGAGTCCTGAACTTGTCTG 
AAGCCCTTGTCCGTAAGCCTTGAACTACGTTCTTAAATCTATGAAGTCGAGGGACCTTTCGCTGCTTTTGTAGGG 
ACTTCTTTCCTTGCTTCAGCAACATGAGGCTTTTCTTGTGGAACGCGGTCTTGACTCTGTTCGTCACTTCTTTGA 
TTGGGGCTTTGATCCCTGAACCAGAAGTGAAAATTGAAGTTCTCCAGAAGCCATTCATCTGCCATCGCAAGACCA 
AAGGAGGGGATTTGATGTTGGTCCACTATGAAGGCTACTTAGAAAAGGACGGCTCCTTATTTCACTCCACTCACA 
AACATAACAATGGTCAGCCCATTTGGTTTACCCTGGGCATCCTGGAGGCTCTCAAAGGTTGGGACCAGGGCTTGA 
AAGGAATGTGTGTAGGAGAGAAGAGAAAGCTCATCATTCCTCCTGCTCTGGGCTATGGAAAAGAAGGAAAAGGTA 
AAATTCCCCCAGAAAGTACACTGATATTTAATATTGATCTCCTGGAGATTCGAAATGGACCAAGATCCCATGAAT 
CATTCCAAGAAATGGATCTTAATGATGACTGGAAACTCTCTAAAGATGAGGTTAAAGCATATTTAAAGAAGGAGT 
TTGAAAAACATGGTGCGGTGGTGAATGAAAGTCATCATGATGCTTTGGTGGAGGATATTTTTGATAAAGAAGATG 
AAGACAAAGATGGGTTTATATCTGCCAGAGAATTTACATATAAACACGATGAGTTATA6AGATACATCTACCCTT 
TTAATATAGCACTCATCTTTCAAGAGAGGGCAGTCATCTTTAAAGAACATTTTATTTTTATACAATGTTCTTTCT 
TGCTTTGTTTTTTATTTTTATATATTTTTTCTGACTCCTATTTAAAGAACCCCTTAGGTTTCTAAGTACCCATTT 
CTTTCTGATAAGTTATTGGGAAGAAAAAGCTAATTGGTCTTTGAATAGAAGACTTCTGGACAATTTTTCACTTTC 
ACAGATATGAAGCTTTGTTTTACTTTCTCACTTATAAATTTAAAATGTTGCAACTGGGAATATACCACGACATGA 
GACCAGGTTATAGCACAAATTAGCACCCTATATTTCTGCTTCCCTCTATTTTCTCCAAGTTAGAGGTCAACATTT 
GAAAAGCCTTTTGCAATAGCCCAAGGCTTGCTATTTTCATGTTATAATGAAATAGTTTATGTGTAACTGGCTCTG 
AGTCTCTGCTTGAGGACCAGAGGAAAATGGTTGTTGGACCTGACTTGTTAATGGCTACTGCTTTACTAAGGAGAT 
GTGCAATGCTGAAGTTAGAAACAAGGTTAATAGCCAGGCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGA 
GGCTGAGGCGGGCGGATCACCTGAGGTTGGGAGTTCGAGACCAGCCTGACCAACACGGAGAAACCCTATCTCTAG 
TAAAAATACAAAGTAGCCCGGCGTGGTGATGCGTGCCTGTAATCCCAGCTACCCAGGAAGGCTGAGGCGGCAGAA 
TCACTTGAACCCGAGGCCGAGGTTGCGGTAAGCCGAGATCACCTNCAGCCTGGACACTCTGTCTCGAAAAAAGAA 
AAGAACACGGTTAATACCATATNAATATGTATGCATTGAGACATGCTACCTAGGACTTAAGCTGATGAAGCTTGG 
CTCCTAGTGATTGGTGGCCTATTATGATAAATAGGACAAATCATTTATGTGTGAGTTTCTTTGTAATAAAATGTA 
TCAATATGTTATAGATGAGGTAGAAAGTTATATTTATATTCAATATTTACTTCTTAAGGCTAGCGGAATATCCTT 
CCTGGTTCTTTAATGGGTAGTCTATAGTATATTATACTACAATAACATTGTATCAT7VAGATAAAGTAGTAAACCA 
GTCTACATTTTCCCATTTCTGTCTCATCAAAAACTGAAGTTAGCTGGGTGTGGTGGCTCATGCCTGTAATCCCAG 
CACTTTGGGGGCCAAGGAGGGTGGATCACTTGAGATCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCT 
TGTCTCTACTAAAAATACAAAAATTAGCCAGGCGTGGTGGTGCACACCTGTAGTCCCAGCTACTCGGGAGGCTGA 
GACAGGAGATTTGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAGATTGTGCCACTGCACTCCAGCCTGGG 
TGACAGAGCAAGACTCCATCTCAAAAT^AAAAAAAAAGAAGCAGACCTACAGCAGCTACTATTGAATAAATACCTA 

TCCTGGATTTT 
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FIGURE 384 

MRLFLWNAVLTLFVTSLIGALIPEPEVKIEVLQKPFICHRKTKGGDLMLVHYEGYLEKDGSLF 
HSTHKHNNGQPIWFTLGILEALKGWDQGLKGMCVGEKRKLIIPPALGYGKEGKGKIPPESTLI 
FNIDLLEIRNGPRSHESFQEMDLNDDWKLSKDEVKAYLKKEFEKHGAWNESHHDALVEDIFD 
KEDEDKDGFISAREFTYKHDEL 

Important features : 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 176-179 

Casein kinase II phosphorylation site. 

amino acids 143-146, 156-159, 178-181 and 200-203 

Endoplasmic reticulum targeting sequence . 

amino acids 208-211 

FKBP-type peptidyl -prolyl cis-trans isomerase 

amino acids 78-114 and 118-131 

EF-hand calcium-binding domain. 

amino acids 191-203, 184-203 and 140-159 

S-100/lCaBP type calcium binding domain 

amino acids 183-203 
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FIGURE 385 

CTCCCACGGTGTCCAGCGCCCAGAATGCGGCTTCTGGTCCTGCTATGGGGTTGCCTGCTGCTC 
CCAGGTTATGAAGCCCTGGAGGGCCCAGAGGAAATCAGCGGGTTCGAAGGGGACACTGTGTCC 
CTGCAGTGCACCTACAGGGAAGAGCTGAGGGACCACCGGAAGTACTGGTGCAGGAAGGGTGGG 
ATCCTCTTCTCTCGCTGCTCTGGCACCATCTATGCAGAAGAAGAAGGCCAGGAGACAATGAAG 
GGCAGGGTGTCCATCCGTGACAGCCGCCAGGAGCTCTCGCTCATTGTGACCCTGTGGAACCTC 
ACCCTGCAAGACGCTGGGGAGTACTGGTGTGGGGTCGAAAAACGGGGCCCCGATGAGTCTTTA 
CTGATCT.CTCTGTTCGTCTTTCCAGGACCCTGCTGTCCTCCCTCCCCTTCTCCCACCTTCCAG 
CCTCTGGCTACAACACGCCTGCAGCCCAAGGCAAAAGCTCAGCAAACCCAGCCCCCAGGATTG 
ACTTCTCCTGGGCTCTACCCGGCAGCCACCACAGCCAAGCAGGGGAAGACAGGGGCTGAGGCC 
CCTCCATTGCCAGGGACTTCCCAGTACGGGCACGAAAGGACTTCTCAGTACACAGGAACCTCT 
CCTCACCCAGCGACCTCTCCTCCTGCAGGGAGCTCCCGCCCCCCCATGCAGCTGGACTCCACC 
TCAGCAGAGGACACCAGTCCAGCTCTCAGCAGTGGCAGCTCTAAGCCCAGGGTGTCCATCCCG 
ATGGTCCGCATACTGGCCCCAGTCCTGGTGCTGCTGAGCCTTCTGTCAGCCGCAGGCCTGATC 
GCCTTCTGCAGCCACCTGCTCCTGTGGAGAAAGGAAGCTCAACAGGCCACGGAGACACAGAGG 
AACGAGAAGTTCTGGCTCTCACGCTTGACTGCGGAGGAAAAGGAAGCCCCTTCCCAGGCCCCT 
GAGGGGGACGTGATCTCGATGCCTCCCCTCCACACATCTGAGGAGGAGCTGGGCTTCTCGAAG 
TTTGTCTCAGCG TAG GGCAGGAGGCCCTCCTGGCCAGGCCAGCAGTGAAGCAGTATGGCTGGC 
TGGATCAGCACCGATTCCCGAAAGCTTTCCACCTCAGCCTCAGAGTCCAGCTGCCCGGACTCC 
AGGGCTCTCCCCACCCTCCCCAGGCTCTCCTCTTGCATGTTCCAGCCTGACCTAGAAGCGTTT 
GTCAGCCCTGGAGCCCAGAGCGGTGGCCTTGCTCTTCCGGCTGGAGACTGGGACATCCCTGAT 
AGGTTCACATCCCTGGGCAGAGTACCAGGCTGCTGACCCTCAGCAGGGCCAGACAAGGCTCAG 
TGGATCTGGTCTGAGTTTCAATCTGCCAGGAACTCCTGGGCCTCATGCCCAGTGTCGGACCCT 
GCCTTCCTCCCACTCCAGACCCCACCTTGTCTTCCCTCCCTGGCGTCCTCAGACTTAGTCCCA 
CGGTCTCCTGCATCAGCTGGTGATGAAGAGGAGCATGCTGGGGTGAGACTGGGATTCTGGCTT 
CTCTTTGAACCACCTGCATCCAGCCCTTCAGGAAGCCTGTGAAAAACGTGATTCCTGGCCCCA 
CCAAGACCCACCAA7VACCATCTCTGGGCTTGGTGCAGGACTCTGAATTCTAACAATGCCCAGT 
GACTGTCGCACTTGAGTTTGAGGGCCAGTGGGCCTGATGAACGCTCACACCCCTTCAGGTTAG 
AGTCTGCATTTGGGCTGTGACGTCTCCACCTGCCCCAATAGATCTGCTCTGTCTGCGACACCA 
GATCCACGTGGGGACTCCCCTGAGGCCTGCTAAGTCCAGGCCTTGGTCAGGTCAGGTGCACAT 
TGCAGGATAAGCCCAGGACCGGCACAGAAGTGGTTGCCTTTNCCATTTGCCCTCCCTGGNCGA 
TGCCTTCTTGCCTTTGGAAAAAATGATGAAGAAAACCTTGGCTCCTTCCTTGTCTGGAAAGGG 
TTACTTGCCTATGGGTTCTGGTGGCTAGAGAGAAAAGTAGAAAACCAGAGTGCACGTAGGTGT 
CTAACACAGAGGAGAGTAGGAACAGGGCGGATACCTGAAGGTGACTCCGAGTCCAGCCCCCTG 
GAGAAGGGGTCGGGGGTGGTGGTAAAGTAGCACAACTACTATTTTTTTTCTTTTTCCATTATT 
ATTGTTTTTTAAGACAGAATCTCGTGCTGCTGCCCAGGCTGGAGTGCAGTGGCACGATCTGCA 
AACTCCGCCTCCTGGGTTCAAGTGATTCTTCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAG 
GCACGCACCACCACACCTGGCTAATTTTTGTACTTTTAGTAGAGATGGGGTTTCACCATGTTG 
GCCAGGCTGGTCTTGAACTCCTGACCTCA7\ATGAGCCTCCTGCTTCAGtCTCCCAAATTGCCG 
GGATTACAGGCATGAGCCACTGTGTCTGGCCCTATTTCCTTTAAAAAGTGAAATTAAGAGTTG 
TTCAGTATGCAAAACTTGGAAAGATGGAGGAGAAAAAGT^AAAGGAAGAAAAAAATGTCACCCA 
TAGTCTCACCAGAGACTATCATTATTTCGTTTTGTTGTACTTCCTTCCACTCTTTTCTTCTTC 
ACATAATTTGCCGGTGTTCTTTTTACAGAGCAATTATCTTGTATATACAACTTTGTATCCTGC 
CTTTTCCACCTTATCGTTCCATCACTTTATTCCAGCACTTCTCTGTGTTTTACAGACCTTTTT 
ATAAATAAAATGTTCATCAGCTGCATAAAAAAAAAAAAAA 
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FIGURE 386 

MRLLVLLWGCLLLPGYEALEGPEEISGFEGDTVSLQCTYREELRDHRKYWCRKGGILFSRCSG 
TIYAEEEGQETMKGRVSIRDSRQELSLIVTLWNLTLQDAGEYWCGVEKRGPDESLLISLFVFP 
GPCCPPSPSPTFQPLATTRLQPKAKAQQTQPPGLTSPGLYPAATTAKQGKTGAEAPPLPGTSQ 
YGHERTSQYTGTSPHPATSPPAGSSRPPMQLDSTSAEDTSPALSSGSSKPRVSIPMVRILAPV 
LVLLSLLSAAGLIAFCSHLLLWRKEAQQATETQRNEKFWLSRLTAEEKEAPSQAPEGDVISMP 
PLHTSEEELGFSKFVSA 

Important features: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 248-269 

N-glycosylation site. 

amino acids 96-99 

Fibrinogen beta and gamma chains C- terminal domain. 

amino acids 104-113 

Ig like V-type domain: 

amino acids 13-128 
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FIGURE 387 

GCGCCGGGAGCCCATCTGCCCCCAGGGGCACGGGGCGCGGGGCCGGCTCCCGCCCGGCACATG 
GCTGCAGCCACCTCGCGCGCACCCCGAGGCGCCGCGCCCAGCTCGCCCGAGGTCCGTCGGAGG 
CGCCCGGCCGCCCCGGAGCCAAGCAGCAACTGAGCGGGGAAGCGCCCGCGTCCGGGGATCGGG 
ATG TCCCTCCTCCTTCTCCTCTTGCTAGTTTCCTACTATGTTGGAACCTTGGGGACTCACACT 
GAGATCAAGAGAGTGGCAGAGGAAAAGGTCACTTTGCCCTGCCACCATCAACTGGGGCTTCCA 
GAAAAAGACACTCTGGATATTGAATGGCTGCTCACCGATAATGAAGGGAACCAAAAAGTGGTG 
ATCACTTACTCCAGTCGTCATGTCTACAATAACTTGACTGAGGAACAGAAGGGCCGAGTGGCC 
TTTGCTTCCAATTTCCTGGCAGGAGATGCCTCCTTGCAGATTGAACCTCTGAAGCCCAGTGAT 
GAGGGCCGGTACACCTGTAAGGTTAAGAATTCAGGGCGCTACGTGTGGAGCCATGTCATCTTA 
AAAGTCTTAGTGAGACCATCCAAGCCCAAGTGTGAGTTGGAAGGAGAGCTGACAGAAGGAAGT 
GACCTGACTTTGCAGTGTGAGTCATCCTCTGGCACAGAGCCCATTGTGTATTACTGGCAGCGA 
ATCCGAGAGAAAGAGGGAGAGGATGAACGTCTGCCTCCCAAATCTAGGATTGACTACAACCAC 
CCTGGACGAGTTCTGCTGCAGAATCTTACCATGTCCTACTCTGGACTGTACCAGTGCACAGCA 
GGCAACGAAGCTGGGAAGGAAAGCTGTGTGGTGCGAGTAACTGTACAGTATGTACAAAGCATC 
GGCATGGTTGCAGGAGCAGTGACAGGCATAGTGGCTGGAGCCCTGCTGATTTTCCTCTTGGTG 
TGGCTGCTAATCCGAAGGAAAGACAAAGAAAGATATGAGGAAGAAGAGAGACCTAATGAAATT 
CGAGAAGATGCTGAAGCTCCAAAAGCCCGTCTTGTGAAACCCAGCTCCTCTTCCTCAGGCTCT 
CGGAGCTCACGCTCTGGTTCTTCCTCCACTCGCTCCACAGCAAATAGTGCCTCACGCAGCCAG 
CGGACACTGTCAACTGACGCAGCACCCCAGCCAGGGCTGGCCACCCAGGCATACAGCCTAGTG 
GGGCCAGAGGTGAGAGGTTCTGAACCAAAGAAAGTCCACCATGCTAATCTGACCAAAGCAGAA 
ACCACACCCAGCATGATCCCCAGCCAGAGCAGAGCCTTCCAAACGGTCTGAATTACAATGGAC 
TTGACTCCCACGCTTTCCTAGGAGTCAGGGTCTTTGGACTCTTCTCGTCATTGGAGCTCAAGT 
CACCAGCCACACAACCAGATGAGAGGTCATCTAAGTAGCAGTGAGCATTGCACGGAACAGATT 
CAGATGAGCATTTTCCTTATACAATACCAAACAAGCAAAAGGATGTAAGCTGATTCATCTGTA 
AAAAGGCATCTTATTGTGCCTTTAGACCAGAGTAAGGGAAAGCAGGAGTCCAAATCTATTTGT 
TGACCAGGACCTGTGGTGAGAAGGTTGGGGAAAGGTGAGGTGAATATACCTAAAACTTTTAAT 
GTGGGATATTTTGTATCAGTGCTTTGATTCACAATTTTCAAGAGGAAATGGGATGCTGTTTGT 
AAATTTTCTATGCATTTCTGCAAACTTATTGGATTATTAGTTATTCAGACAGTCAAGCAGAAC 
CCACAGCCTTATTACACCTGTCTACACCATGTACTGAGCTAACCACTTCTAAGAAACTCCAAA 
AAAGGAAACATGTGTCTTCTATTCTGACTTi\ACTTCATTTGTCATAAGGTTTGGATATTAATT 
TCAAGGGGAGTTGAAATAGTGGGAGATGGAGAAGAGTGAATGAGTTTCTCCCACTCTATACTA 
ATCTCACTATTTGTATTGAGCCCAAAATAACTATGAAAGGAGACAAAAATTTGTGACAAAGGA 
TTGTGAAGAGCTTTCCATCTTCATGATGTTATGAGGATTGTTGACAAACATTAGAAATATATA 
ATGGAGCAATTGTGGATTTCCCCTCAAATCAGATGCCTCTAAGGACTTTCCTGCTAGATATTT 
CTGGAAGGAGAAAATACAACATGTCATTTATCAACGTCCTTAGAAAGAATTCTTCTAGAGAAA 
AAGGGATCTAGGAATGCTGAAAGATTACCCAACATACCATTATAGTCTCTTCTTTCTGAGAAA 
ATGTGAAACCAGAATTGCAAGACTGGGTGGACTAGAAAGGGAGATTAGATCAGTTTTCTCTTA 
ATATGTCAAGGAAGGTAGCCGGGCATGGTGCCAGGCACCTGTAGGAAAATCCAGCAGGTGGAG 
GTTGCAGTGAGCCGAGATTATGCCATTGCACTCCAGCCTGGGTGACAGAGCGGGACTCCGTCTC 
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FIGURE 388 

MSLLLLLLLVSYYVGTLGTHTEIKRVAEEKVTLPCHHQLGLPEKDTLDIEWLLTDNEGNQKW 
ITYSSRHVYNNLTEEQKGRVAFASNFLAGDASLQIEPLKPSDEGRYTCKVKNSGRYVWSHVIL 
iCVLVRPSKPKCELEGELTEGSDLTLQCESSSGTEPIVYYWQRIREKEGEDERLPPKSRIDYNH 
PGRVLLQNLTMSYSGLYQCTAGNEAGKESCVVRVTVQYVQSIGMVAGAVTGIVAGALLIFLLV 
WLLIRRKDKERYEEEERPNEIREDAEAPKARLVKPSSSSSGSRSSRSGSSSTRSTANSASRSQ 
RTLSTDAAPQPGLATQAYSLVGPEVRGSEPKKVHHANLTKAETTPSMIPSQSRAFQTV 

Important freatures: 
Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 232-251 
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FIGURE 389 

GCGGCACCTGGAAGMGCGCCCATTGGCTGGTGGCCTGCTCAAGGTGGTGTTCGTGGTCTTCG 
CCTCCTTGTGTGCCTGGTATTCGGGGTACCTGCTCGCAGAGCTCATTCCAGATGCACCCCTGT 
CCAGTGCTGCCTATAGCATCCGCAGCATCGGGGAGAGGCCTGTCCTCAAAGCTCCAGTCCCCA 
AAAGGCAAAAATGTGACCACTGGACTCCCTGCCCATCTGACACCTATGCCTACAGGTTACTCA 
GCGGAGGTGGCAGAAGCAAGTACGCCAAAATCTGCTTTGAGGATAACCTACTTATGGGAGAAC 
AGCTGGGAAATGTTGCCAGAGGAATAAACATTGCCATTGTCAACTATGTAACTGGGAATGTGA 
CAGCAACACGATGTTTTGATATGTATGAAGGCGATAACTCTGGACCGATGACAAAGTTTATTC 
AGAGTGCTGCTCCAAAATCCCTGCrCTTCATGGTGACCTATGACGACGGAAGCACAAGACTGA 
ATAACGATGCCAAGAATGCCATAGAAGCACTTGGAAGTAAAGAAATCAGGAACATGAAATTCA 
GGTCTAGCTGGGTATTTATTGCAGCAAAAGGCTTGGAACTCCCTTCCGAAATTCAGAGAGAAA 
AGATCAACCACTCTGATGCTAAGAACAACAGATATTCTGGCTGGCCTGCAGAGATCCAGATAG 
AAGGCTGCATACCCAAAGAACGAAGCTGACACTGCAGGGTCCTGAGTAAATGTGTTCTGTATA 
AACAAATGCAGCTGGAATCGCTCAAGAATCTTATTTTTCTAAATCCAACAGCCCATATTTGAT 
GAGTATTTTGGGTTTGTTGTAAACCAATGAACATTTGCTAGTTGTATCAAATCTTGGTACGCA 
GTATTTTTATACCAGTATTTTATGTAGTGAA.GATGTCAATTAGCAGGAAACTAAAATGAATGG 
AAATTCTTAAAAAAAAAA . 
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FIGURE 390 

MRPLAGGLLKWFVVFASLCAWYSGYLLAELIPDAPLSSAAYSIRSIGERPVLKAPVPKRQKC 
DHWTPCPSDTYAYRLLSGGGRSKYAKICFEDNLLMGEQLGNVARGINIAIVNYVTGNVTATRC 
FDMYEGDNSGPMTKFIQSAAPKSLLFMVTYDDGSTRLNNDAKNAIEALGSKEIRNMKFRSSWV 
FIAAKGLELPSEIQREKINHSDAKNNRYSGWPAEIQIEGCIPKERS 

Important features: 
Signal sequence. 

amino acids 1-20 

N-glycosylation sites . 

amino acids 120-124, 208-212 

Glycosaminoglycan attachment site . 

amino acids 80-84 

N-myristoylation sites. 

amino acids 81-87, 108-114, 119-125 
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FIGURE 391 

GGGGGCTTTCTTGGGCTTGGCTGCTTGGAACACCTGCCTCCAAGGACCGGCCTCGGAGGGGTCGCCGGGAAAGGG 

AGGGAAGAAGGAAGGGCGGGGCCGGCCCCCCTGCGCCCGCCCCGCGCCTCTGCGCGCCCCTGTCCGCCCCGGCCC 

AGCCCAGCCCAGCCCCGCGGGCCGGTCACACGCGCAGCCAGCCGGCCGCCTCCCGCGCCCAAGCGCGCCGCTCTG 

CTGTGCCCTGCGCCCTTGCCCCGCGCCAGCTTCTGCGCCCGCAGCCCGCCCGGCGCCCCCGGTGACCGTGACCCT 

GCCCTGGGCGCGGGGCGGAGCAGGCMGTCCCGCCCGGGGACCGCTACCCCAGCGCTGGCCCTGGTGCTCCTGGC 

AGTGACCCTGGCCGGGGTCGGAGCCCAGGGCGCAGCCCTCGAGGACCCTGATTATTACGGGCAGGAGATCTGGAG 

CCGGGAGCCCTACTACGCGCGCCCGGAGCCCGAGCTCGAGACCTTCT.CTCCGCCGCTGCCTGCGGGGCCCGGGGA 

GGAGTGGGAGCGGCGCCCGCAGGAGCCCAGGCCGCCCAAGAGGGCCACCAAGCCCAAGAAAGCTCCCAAGAGGGA 

GAAGTCGGCTCCGGAGCCGCCTCCACCAGGTAT^ACACAGCAACAAAAAAGTTATGAGAACCAAGAGCTCTGAGAA 

GGCTGCCAACGATGATCACAGTGTCCGTGTGGCCCGTGAAGATGTCAGAGAGAGTTGCCCACCTCTTGGTCTGGA 

AACCTTAAAAATCACAGACTTCCAGCTCCATGCCTCCACGGTGAAGCGCTATGGCCTGGGGGCACATCGAGGGAG 

ACTCAACATCCAGGCGGGCATTAATGAAAATGATTTTTATGACGGAGCGTGGTGCGCGGGAAGAAATGACCTCCA 

GCAGTGGATTGAAGTGGATGCTCGGCGCCTGACCAGATTCACTGGTGTCATCACTCAAGGGAGGAACTCCCTCTG 

GCTGAGTGACTGGGTGACATCCTATAAGGTCATGGTGAGCAATGACAGCCACACGTGGGTCACTGTTAAGAATGG 

ATCTGGAGACATGATATTTGAGGGAAACAGTGAGAAGGAGATCCCTGTTCTCAATGAGCTACCCGTCCCCATGGT 

GGCCCGCTACATCCGCATAAACCCTCAGTCCTGGTTTGATAATGGGAGCATCTGCATGAGAATGGAGATCCTGGG 

CTGCCCACTGCCAGATCCTAATAATTATTATCACCGCCGGAACGAGATGACCACCACTGATGACCTGGATTTTAA 

GCACCACAATTATAAGGAAATGCGCCAGTTGATGAAAGTTGTGAATGAAATGTGTCCCAATATCACCAGAATTTA 

CAACATTGGAAAAAGCCACCAGGGCCTGAAGCTGTATGCTGTGGAGATCTCAGATCACCCTGGGGAGCATGAAGT 

CGGTGAGCCCGAGTTCCACTACATCGCGGGGGCCCACGGCAATGAGGTGCTGGGCCGGGAGCTGCTGCTGCTGCT 

GGTGCAGTTCGTGTGTCAGGAGTACTTGGCCCGGAATGCGCGCATCGTCCACCTGGTGGAGGAGACGCGGATTCA 

CGTCCTCCCCTCCCTCAACCCCGATGGCTACGAGAAGGCCTACGAAGGGGGCTCGGAGCTGGGAGGCTGGTCCCT 

GGGACGCTGGACCCACGATGGAATTGACATCAACAACAACTTTCCTGATTTAAACACGCTGCTCTGGGAGGCAGA 

GGATCGACAG7VATGTCCCCAGGAAAGTTCCCAATCACTATATTGCAATCCCTGAGTGGTTTCTGTCGGAAAATGC 

CACGGTGGCTGCCGAGACCAGAGCAGTCATAGCCTGGATGGAAAAAATCCCTTTTGTGCTGGGCGGCAACCTGCA 

GGGCGGCGAGCTGGTGGTGGCGTATCCCTACGACCTGGTGCGGTCCCCCTGGAAGACGCAGGAACACACCCCCAC 

CCCCGATGACCACGTGTTCCGCTGGCTGGCCTACTCCTATGCCTCCACACACCGCCTCATGACAGACGCCCGGAG 

GAGGGTGTGCCACACGGAGGACTTCCAGAAGGAGGAGGGCACTGTCAATGGGGCCTCCTGGCACACCGTCGCTGG 

AAGTCTGAACGATTTCAGCTACCTTCATACAAACTGCTTCGAACTGTCCATCTACGTGGGCTGTGATAAATACCC 

ACATGAGAGCCAGCTGCCCGAGGAGTGGGAGAATAACCGGGAATCTCTGATCGTGTTCATGGAGCAGGTTCATCG 

TGGCATTAAAGGCTTGGTGAGAGATTCACATGGAAAAGGAATCCCAAACGCCATTATCTCCGTAGAAGGCATTAA 

CCATGACATCCGAACAGCCAACGATGGGGATTACtGGCGCCTCCTGAACCCTGGAGAGTATGTGGTCACAGCAAA 

GGCCGAAGGTTTCAC1PGCATCCACCAAGAACTGTATGGTTGGCTATGACATGGGGGCCACAAGGTGTGACTTCAC 

ACTTAGCAAAACCAACATGGCCAGGATCCGAGAGATCATGGAGAAGTTTGGGAAGCAGCCCGTCAGCCTGCCAGC 

CAGGCGGCTGAAGCTGCGGGGGCGGAAGAGACGACAGCGTGGGTGACCCTCCTGGGCCCTTGAGACTCGTCTGGG 

ACCCATGCAAATTAAACCAACCTGGTAGTAGCTCCATAGTGGACTCACTCACTGTTGTTTCCTCTGTAATTCAAG 

AAGTGCCTGGAAGAGAGGGTGCATTGTGAGGCAGGTCCCAAAAGGGAAGGCTGGAGGCTGAGGCTGTTTTCTTTT 

CTTTGTTCCCATTTATCCAAATMCTTGGACAGAGCAGCAGAGAAAAGCTGATGGGAGTGAGAGAACTCAGCAAG 

CCAACCTGGGAATCAGAGAGAGAAGGAGAAGGAGGGGAGCCTGTCCGTTCAGAGCCTCTGGCTGCATAGAAAAGG 

ATTCTGGTGCTTCCCCTGTTTGCGTGGCAGCAAGGGTTCCACGTGCATTTGCAATTTGCACAGCTAAAATTGCAG 

CATTTCCCCAGCTGGGCTGTCCCAAATGTTACCATTTGAGATGCTCCCAGGCGTCCTAAGAGAATCCACCCTCTC 

TGGCCCTGGGACATTGCAAGCTGCTACA7UVTAAATTCTGTGTTCTTTTGACAATAGCGTCATTGCCAAGTGCACA 

TCAGTGAGCCTCTTGAATCTGTTTAGTCTCCTTTTTCAACAAAGGAGTGTGTTCAGAAAAGGAGAGAGAGGCTGA 

GATCATTCAGGAGTTTGTTGGGCAGCAAGCATGGAGCTTCTTGCACAAATTCTGGGTCCATAAACAACCCCCAAA 

GTCCCTGCTGATCCAGTAGCCCTGGAGGTTCCCCAGGTAGGGAGAGCCAGAGGTGCCAGCCTTCCTGAAGGGCCA 

GAAAATTTAGCCTGGATCTCCTCTTTTACCTGCTAGGACTGGAAAGAGCCAGAAGTGGGGTGGCCTGAAGCCCTC 

TCTCTGCTTGAGGTATTGCCCCTGTGTGGAATTGAGTGCTCATGGGTTGGCCTCATATCAGCCTGGGAGTTATTT 

TTGATATGTAGAATGCCAGATCTTCCAGATTAGGCTAAATGTAATGAAAACCTCTTAGGATTATCTGTGGAGCAT 

CAGTTTGGGAAGAATTATTGAATTATCTTGCAAGAAAAAAGTATGTCTCACTTTTTGTTAATGTTGCTGCCTCAT 

TGACCTGGGAAAAATGAAAAAAAAAAATAAAGCAAATGGTAAGACCCTTAAAA 

AAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 392 

MSRPGTATPALALVLLAVTLAGVGAQGAALEDPDYYGQEIWSREPYYARPEPELETFSPPLPA 
GPGEEWERRPQEPRPPKRATKPKKAPKREKSAPEPPPPGKHSNKKVMRTKSSEKAANDDHSVR 
VAREDVRESCPPLGLETLKITDFQLHASTVKRYGLGAHRGRLNIQAGINENDFYDGAWCAGRN 
DLQQWIEVDARRLTRFTGVITQGRNSLWLSDWVTSYKVMVSNDSHTWVTVKNGSGDMIFEGNS 
EKEIPVLNELPVPMVARYIRINPQSWFDNG.SICMRMEILGCPLPDPNNYYHRRNEMTTTDDLD 
FKHHNYKEMRQLMKWNEMCPNITRIYNIGKSHQGLKLYAVEISDHPGEHEVGEPEFHYIAGA 
HGNEVLGRELLLLLVQFVCQEYLARNARIVHLVEETRIHVLPSLNPDGYEKAYEGGSELGGWS 
LGRWTHDGIDINNNFPDLNTLLWEAEDRQNVPRKVPNHYIAIPEWFLSENATVAAETRAVIAW 
MEKIPFVLGGNLQGGELWAYPYDLVRSPWKTQEHTPTPDDHVFRWLAYSYASTHRLMTDARR 
RVCHTEDFQKEEGTVNGASWHTVAGSLNDFSYLHTNCFELSIYVGCDKYPHESQLPEEWENNR 
ESLIVFMEQVHRGIKGLVRDSHGKGIPNAIISVEGINHDIRTANDGDYWRLLNPGEYWTAKA 
EGFTASTKNCMVGYDMGATRCDFTLSKTN^4ARIREIMEKFGKQPVSLPARRLKLRGRKRRQRG 
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FIGURE 393 

GTCCCACATCCTGCTCAACTGGGTCAGGTCCCTCTTAGACCAGCTCTTGTCCATCATTTGCTGAAGTGGACCAAC 
TAGTTCCCCAGTAGGGGGTCTCCCCTGGCAATTCTTGATCGGCGTTTGGACATCTCAGATCGCTTCCAATGAAGA 
TGGCCTTGCCTTGGGGTCCTGCTTGTTTCATAATCATCTAACTATGGGACAAGGTTGTGCCGGCAGCTCTGGGGG 
AAGGAGCACGGGGCTGATCAAGCCATCCAGGAAACACTGGAGGACTTGTCCAGCCTTGAAAGAACTCTAGTGGTT 
TCTGAATCTAGCCCACTTGGCGGTAAGCATGATGCAACTTCTGCAACTTCTGCTGGGGCTTTTGGGGCCAGGTGG 
CTACTTATTTCTTTTAGGGGATTGTCAGGAGGTGACCACTCTCACGGTGAAATACCAAGTGTCAGAGGAAGTGCC 
ATCTGGTACAGTGATCGGGAAGCTGTCCCAGGAACTGGGCCGGGAGGAGAGGCGGAGGCAAGCTGGGGCCGCCTT 
CCAGGTGTTGCAGCTGCCTCAGGCGCTCCCCATTCAGGTGGACTCTGAGGAAGGCTTGCTCAGCACAGGCAGGCG 
GCTGGATCGAGAGCAGCTGTGCCGACAGTGGGATCCCTGCCTGGTTTCCTTTGATGTGCTTGCCACAGGGGATTT 
GGCTCTGATCCATGTGGAGATCCAAGTGCTGGACATCAATGACCACCAGCCACGGTTTCCCAAAGGCGAGCAGGA 
GCTGGAAATCTCTGAGAGCGCCTCTCTGCGAACCCGGATCCCCCTGGACAGAGCTCTTGACCCAGAGACAGGCCC 
TAACACCCTGCACACCTACACTCTGTCTCCCAGTGAGCACTTTGCCTTGGATGTCATTGTGGGCCCTGATGAGAC 
CAAACATGCAGAACTCATAGTGGTGAAGGAGCTGGACAGGGAAATCCATTCATTTTTTGATCTGGTGTTAACTGC 
CTATGACAATGGGAACCCCCCCAAGTCAGGTACCAGCTTGGTCAAGGTCAACGTCTTGGACTCCAATGACAATAG 
CCCTGCGTTTGCTGAGAGTTCACTGGCACTGGAAATCCAAGAAGATGCTGCACCTGGTACGCTTCTCATAAAACT 
GACCGCCACAGACCCTGACCAAGGCCCCAATGGGGAGGTGGAGTTCTTCCTCAGTAAGCACATGCCTCCAGAGGT 
GCTGGACACCTTCAGTATTGATGCCAAGACAGGCCAGGTCATTCTGCGTCGACCTCTAGACTATGAAAAGAACCC 
TGCCTACGAGGTGGATGTTCAGGCAAGGGACCTGGGTCCCAATCCTATCCCAGCCCATTGCAAAGTTCTCATCAA 
GGTTCTGGATGTCAATGACAACATCCCAAGCATCCACGTCACATGGGCCTCCCAGCCATCACTGGTGTCAGAAGC 
TCTTCCCAAGGACAGTTTTATTGCTCTTGTCATGGCAGATGACTTGGATTCAGGACACAATGGTTTGGTCCACTG 
CTGGCTGAGCCAAGAGCTGGGCCACTTCAGGCTGAAAAGAACTAATGGCAACACATACATGTTGCTAACCAATGC 
CACACTGGACAGAGAGCAGTGGCCCAAATATACCCTCACTCTGTTAGCCCAAGACCAAGGACTCCAGCCCTTATC 
AGCCAAGAAACAGCTCAGCATTCAGATCAGTGACATCAACGACAATGCACCTGTGTTTGAGAAAAGCAGGTATGA 
AGTCTCCACGCGGGAAAACAACTTACCCTCTCTTCACCTCATTACCATCAAGGCTCATGATGCAGACTTGGGCAT 
TAATGGAAAAGTCTCATACCGCATCCAGGACTCCCCAGTTGCTCACTTAGTAGCTATTGACTCCAACACAGGAGA 
GGTCACTGCTCAGAGGTCACTGAACTATGAAGAGATGGCCGGCTTTGAGTTCCAGGTGATCGCAGAGGACAGCGG 
GCAACCCATGCTTGCATCCAGTGTCTCTGTGTGGGTCAGCCTCTTGGATGCCAATGATAATGCCCCAGAGGTGGT 
CCAGCCTGTGCTCAGCGATGGAAAAGCCAGCCTCTCCGTGCTTGTGAATGCCTCCACAGGCCACCTGCTGGTGCC 
CATCGAGACTCCCAATGGCTTGGGCCCAGCGGGCACTGACACACCTCCACTGGCCACTCACAGCTCCCGGCCATT 
CCTTTTGACAACCATTGTGGCAAGAGATGCAGACTCGGGGGCAAATGGAGAGCCCCTCTACAGCATCCGCAATGG 
AAATG71AGCCCACCTCTTCATCCTCAACCCTCATACGGGGCAGCTGTTCGTC7^ATGTCACCAATGCCAGCAGCCT 
CATTGGGAGTGAGTGGGAGCTGGAGATAGTAGTAGAGGACCAGGGAAGCCCCCCCTTACAGACCCGAGCCCTGTT 
GAGGGTCATGTTTGTCACCAGTGTGGACCACCTGAGGGACTCAGCCCGCAAGCCTGGGGCCTTGAGCATGTCGAT 
GCTGACGGTGATCTGCCTGGCTGTACTGTTGGGCATCTTCGGGTTGATCCTGGCTTTGTTCATGTCCATCTGCCG 
GACAGAAAAGAAGGACAACAGGGCCTACAACTGTCGGGAGGCCGAGTCCACCTACCGCCAGCAGCCCAAGAGGCC 
CCAGAAACACATTCAGAAGGCAGACATCCACCTCGTGCCTGTGCTCAGGGGTCAGGCAGGTGAGCCTTGTGAAGT 
CGGGCAGTCCCACAAAGATGTGGACAAGGAGGCGATGATGGAAGCAGGCTGGGACCCCTGCCTGCAGGCCCCCTT 
CCACCTCACCCCGACCCTGTACAGGACGCTGCGTAATCAAGGCAACCAGGGAGCACCGGCGGAGAGCCGAGAGGT 
GCTGGAAGACACGGTCAACCTCCTTTTCAACCATCCCAGGCAGAGGAATGCCTCCCGGGAGAACCTGAACCTTCC 
CGAGCCCCAGCCTGCCACAGGCCAGCCACGTTCCAGGCCTCTGAAGGTTGCAGGCAGCCCCACAGGGAGGCTGGC 
TGGAGACCAGGGCAGTGAGGAAGCCCCACAGAGGCCACCAGCCTCCTCTGCAACCCTGAGACGGCAGCGACATCT 
CAATGGCAAAGTGTCCCCTGAGAAAGAATCAGGGCCCCGTCAGATCCTGCGGAGCCTGGTCCGGCTGTCTGTGGC 
TGCCTTCGCCGAGCGGAACCCCGTGGAGGAGCTCACTGTGGATTCTCCTCCTGTTCAGCAAATCTCCCAGCTGCT 
GTCCTTGCTGCATCAGGGCCAATTCCAGCCCAAACCAAACCACCGAGGAAATAAGTACTTGGCCAAGCCAGGAGG 
CAGCAGGAGTGCAATCCCAGACACAGATGGCCCAAGTGCAAGGGCTGGAGGCCAGACAGACCCAGAACAGGAGGA 
AGGGCCTTTGGATCCTGAAGAGGACCTCTCTGTGAAGCAACTGCTAGAAGAAGAGCTGTCAAGTCTGCTGGACCC 
CAGCACAGGTCTGGCCCTGGACCGGCTGAGCGCCCCTGACCCGGCCTGGATGGCGAGACTCTCTTTGCCCCTCAC 
CACCAACTACCGTGACAATGTGATCTCCCCGGATGCTGCAGCCACGGAGGAGCCGAGGACCTTCCAGACGTTCGG 
CAAGGCAGAGGCACCAGAGCTGAGCCCAACAGGCACGAGGCTGGCCAGCACCTTTGTCTCGGAGATGAGCTCACT 
GCTGGAGATGCTGCTGGAACAGCGCTCCAGCATGCCCGTGGAGGCCGCCTCCGAGGCGCTGCGGCGGCTCTCGGT 
CTGCGGGAGGACCCTCAGTTTAGACTTGGCCACCAGTGCAGCCTCAGGCATGAAAGTGCAAGGGGACCCAGGTGG 
AAAGACGGGGACTGAGGGCAAGAGCAGAGGCAGCAGCAGCAGCAGCAGGTGCCTGTGAACATACCTCAGACGCCT 
CTGGATCCAAGAACCAGGGGCCTGAGGATCTGTGGACAAGAGCTGGTTTCTAAAATCTTGTAACTCACTAGCTAG 
CGGCGGCCTGAGAACTTTAGGGTGACTGATGCTACCCCCACAGAGGAGGCAAGAGCCCCAGGACTAACAGCTGAC 
TGACCAAAGCAGCCCCTTGTAAGCAGCTCTGAGTCTTTTGGAGGACAGGGACGGTTTGTGGCTGAGATAAGTGTT 
TCCTGGCAAAACATATGTGGAGCACAAAGGGTCAGTCCTCTGGCAGAACAGATGCCACGGAGTATCACAGGCAGG 
AAAGGGTGGCCTTCTTGGGTAGCAGGAGTCAGGGGGCTGTACCCTGGGGGTGCCAGGAAATGCTCTCTGACCTAT 
CAATAAAGGAAAAGCAGTAAAAAAAAAAAAAAAAAAAA 
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FIGURE 394 

MMQLLQLLLGLLGPGGYLFLLGDCQEVTTLTVKYQVSEEVPSGTVIGKLSQELGREERRRQAG 
AAFQVLQLPQALPIQVDSEEGLLSTGRRLDREQLCRQWDPCLVSFDVLATGDLALIHVEIQVL 
DINDHQPRFPKGEQELEISESASLRTRIPLDRALDPDTGPNTLHTYTLSPSEHFALDVIVGPD 
ETKHAELIWKELDREIHSFFDLVLTAYDNGNPPKSGTSLVKVNVLDSNDNSPAFAESSLALE 
IQEDAAPGTLLIKLTATDPDQGPNGEVEFFLSKHMPPEVLDTFSIDAKTGQVILRRPLDYEKN 
PAYEVDVQARDLGPNPIPAHCKVLIKVLDVNDNIPSIHVTWASQPSLVSEALPKDSFIALVMA 
DDLDSGHNGLVHCWLSQELGHFRLKRTNGNTYMLLTNATLDREQWPKYTLTLLAQDQGLQPLS 
AKKQLSIQISDINDNAPVFEKSRYEVSTRENNLPSLHLITIKAHDADLGINGKVSYRIQDSPV 
AHLVAIDSNTGEVTAQRSLNYEEMAGFEFQVIAEDSGQPMLASSVSVWVSLLDANDNAPEWQ 
PVLSDGKASLSVLVNASTGHLLVPIETPNGLGPAGTDTPPLATHSSRPFLLTTIVARDADSGA 
NGEPLYSIRNGNEAHLFILNPHTGQLFVNVTNASSLIGSEWELEIVVEDQGSPPLQTRALLRV 
MFVTSVDHLRDSARKPGALSMSMLTVICLAVLLGIFGLILALFMSICRTEKKDNRAYNCREAE 
STYRQQPKRPQKHIQKADIHLVPVLRGQAGEPCEVGQSHKDVDKEAMMEAGWDPCLQAPFHLT 
PTLYRTLRNQGNQGAPAESREVLQDTVNLLFNHPRQRNASRENLNLPEPQPATGQPRSRPLKV 
AGSPTGRLAGDQGSEEAPQRPPASSATLRRQRHLNGKVSPEKESGPRQILRSLVRLSVAAFAE 
RNPVEELTVDSPPVQQISQLLSLLHQGQFQPKPNHRGNKYLAKPGGSRSAIPDTDGPSARAGG 
QTDPEQEEGPLDPEEDLSVKQLLEEELSSLLDPSTGLALDRLSAPDPAWMARLSLPLTTNYRD 
NVISPDAAATEEPRTFQTFGKAEAPELSPTGTRLASTFVSEMSSLLEMLLEQRSSMPVEAASE 
ALRRLSVCGRTLSLDLATSAASGMKVQGDPGGKTGTEGKSRGSSSSSRCL 

Important features: 
Signal peptide: 

amino acids 1-13 
Transmembrane domain: 
amino acids 719-739 
N-glycosylation site. 

amino acids 415-418, 582-585, 659-662, 662-665 amd 857-860 
Cadherins extracellular repeated domain signature. 

amino acids 123-133, 232-242, 340-350, 448-458 and 553-563 
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FIGURE 395 

CCCAGGCTCTAGTGCAGGAGGAGAAGGAGGAGGAGCAGGAGGTGGAGATTCCCAGTTAAAAGG 
CTCCAGAATCGTGTACCAGGCAGAGAACTGAAGTACTGGGGCCTCCTCCACTGGGTCCGAATC 
AGTAGGTGACCCCGCCCCTGGATTCTGGAAGACCTCAGCATGGGACGCCCCCGACCTCGTGCG 
GGCAAGACGTGGATGTTCCTGCTCTTGCTGGGGGGAGCCTGGGCAGGACACTCCAGGGCACAG 
GAGGACAAGGTGCTGGGGGGTCATGAGTGCCAACCCCATTCGCAGCCTTGGCAGGCGGCCTTG 
TTCCAGGGCCAGCAACTACTCTGTGGCGGTGTCCTTGTAGGTGGCAACTGGGTCCTTACAGCT 
GCCCACTGTAAAAAACCGAAATACACAGTACGCCTGGGAGACCACAGCCTACAGAATAAAGAT 
GGCCCAGAGCAAGAAATACCTGTGGTTCAGTCCATCCCACACCCCTGCTACAACAGCAGCGAT 
GTGGAGGACCACAACCATGATCTGATGCTTCTTCAACTGCGTGACCAGGCATCCCTGGGGTCC 
AAAGTGAAGCCCATCAGCCTGGCAGATCATTGCACCCAGCCTGGCCAGAAGTGCACCGTCTCA 
GGCTGGGGCACTGTCACCAGTCCCCGAGAGAATTTTCCTGACACTCTCAACTGTGCAGAAGTA 
AAAATCTTTCCCCAGAAGAAGTGTGAGGATGCTTACCCGGGGCAGATCACAGATGGCATGGTC 
TGTGCAGGCAGCAGCAAAGGGGCTGACACGTGCCAGGGCGATTCTGGAGGCCCCCTGGTGTGT 
GATGGTGCACTCCAGGGCATCACATCCTGGGGCTCAGACCCCTGTGGGAGGTCCGACAAACCT 
GGCGTCTATACCAACATCTGCCGCTACCTGGACTGGATCAAGAAGATCATAGGCAGCAAGGGC 
TGATTCTAGGATAAGCACTAGATCTCCCTTAATAAACTCACAACTCTCTGGTTC 
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FIGURE 396 

MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQPWQAALFQGQQLLCGGVLV 
GGNWVLTAAHCKKPKYTVRLGDHSLQNKDGPEQEIPVVQSIPHPCYNSSDVEDHNHDLMLLQL 
RDQASLGSKVKPISLADHCTQ-PGQKCTVSGWGTVTSPRENFPDTLNCAEVKIFPQKKCEDAYP 
GQITDGMVCAGSSKGADTCQGDSGGPLVCDGALQGITSWGSDPCGRSDKPGVYTNICRYLDWI 
KKIIGSKG 

Important Features : 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 51-71 

N-glycosylation site. 

amino acids 110-113 

Serine proteases, trypsin family, histidine active site. 

amino acids .69-74 and 207-217 

Tyrosine kinase phosphorylation site. 

amino acids 182-188 



Kringle domain proteins motif 

amino acids 205-217 
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FIGURE 397 

GGCGGCTGCTGAGCTGCCTTGAGGTGCAGTGTTGGGGATCCAGAGCCATGTCGGACCTGCTAC 
TACTGGGCCTGATTGGGGGCCTGACTCTCTTACTGCTGCTGACGCTGCTGGCCTTTGCCGGGT 
ACTCAGGGCTACTGGCTGGGGTGGAAGTGAGTGCTGGGTCACCCCCCATCCGCAACGTCACTG 
TGGCCTACAAGTTCCACATGGGGCTCTATGGTGAGACTGGGCGGCTTTTCACTGAGAGCTGCA 
GCATCTCTCCCAAGCTCCGCTCCATCGCTGTCTACTATGACAACCCCCACATGGTGCCCCCTG 
ATAAGTGCCGATGTGCCGTGGGCAGCATCCTGAGTGAAGGTGAGGAATCGCCCTCCCCTGAGC 
TCATCGACCTCTACCAGAAATTTGGCTTCAAGGTGTTCTCCTTCCCGGCACCCAGCCATGTGG 
TGACAGCCACCTTCCCCTACACCACCATTCTGTCCATCTGGCTGGCTACCCGCCGTGTCCATC 
CTGCCTTGGACACCTACATCAAGGAGCGGAAGCTGTGTGCCTATCCTCGGCTGGAGATCTACC 
AGGAAGACCAGATCCATTTCATGTGCCCACTGGCACGGCAGGGAGACTTCTATGTGCCTGAGA 
TGAAGGAGACAGAGTGGAAATGGCGGGGGCTTGT(SGAGGCCATTGACACCCAGGTGGATGGCA 
CAGGAGCTGACACAATGAGTGACACGAGTTCTGTAAGCTTGGAAGTGAGCCCTGGCAGCCGGG 
AGACTTCAGCTGCCACACTGTCACCTGGGGCGAGCAGCCGTGGCTGGGATGACGGTGACACCC 
GCAGCGAGCACAGCTACAGCGAGTCAGGTGCCAGCGGCTCCTCTTTTGAGGAGCTGGACTTGG 
AGGGCGAGGGGCCCTTAGGGGAGTCACGGCTGGACCCTGGGACTGAGCCCCTGGGGACTACCA 
AGTGGCTCTGGGAGCCCACTGCCCCTGAGAAGGGCAAGGAGTAACCCATGGCCTGCACCCTCC 
TGCAGTGCAGTTGCTGAGGAACTGAGCAGACTCTCCAGCAGACTCTCCAGCCCTCTTCCTCCT 
TCCTCTGGGGGAGGAGGGGTTCCTGAGGGACCTGACTTCCCCTGCTCCAGGCCTCTTGCTAAG 
CCTTCTCCTCACTGC'CCTTTAGGCTCCCAGGGCCAGAGGAGCCAGGGACTATTTTCTGCACCA 
GCCCCCAGGGCTGCCGCCCCTGTTGTGTCTTTTTTTCAGACT.CACAGTGGAGCTTCCAGGACC 
CAGAATAAAGCCAATGATTTACTTGTTTCACCTGGAAAAAAAAAAAAAAAAA 



WO 01/40466 



PCT/US00/32678 



398/550 

FIGURE 398 

MSDLLLLGLIGGLTLLLLLTLLAFAGYSGLLAGVEVSAGSPPIRNVTVAYKFHMGLYGETGRL 
FTESCSISPKLRSIAVYYDNPHMVPPDKCRCAVGSILSEGEESPSPELIDLYQKFGFKVFSFP 
APSHWTATFPYTTILSIWLATRRVHPALDTYIKERKLCAYPRLEIYQEDQIHFMCPLARQGD 
FYVPEMKETEWKWRGLVEAIDTQVDGTGADTMSDTSSVSLEVSPGSRETSAATLSPGASSRGW 
DDGDTRSEHSYSESGASGSSFEELDLEGEGPLGESRLDPGTEPLGTTKWLWEPTAPEKGKE 
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FIGURE 399 

GGACGAGGGCAGATCTCGTTCTGGGGCAAGCCGTTGACACTCGCTCCCTGCCACCGCCCGGGC 
TCCGTGCCGCCAAGTTTTCATTTTCCACCTTCTCTGCCTCCAGTCCCCCAGCCCCTGGCCGAG 
AGAAGGGTCTTACCGGCCGGGATTGCTGGAAACACCAAGAGGTGGTTTTTGTTTTTTAAAACT 
TCTGTTTCTTGGGAGGGGGTGTGGCGGGGCAGGATGAGCAACTCCGTTCCTCTGCTCTGTTTC 
TGGAGCCTCTGCTATTGCTTTGCTGCGGGGAGCCCCGTACCTTTTGGTCCAGAGGGACGGCTG 
GAAGATAAGCTCCACAAACCCAAAGCTACACAGACTGAGGTCAAACCATCTGTGAGGTTTAAC 
CTCCGCACCTCCAAGGACCCAGAGCATGAAGGATGCTACCTCTCCGTCGGCCACAGCCAGCCC 
TTAGAAGACTGCAGTTTCAACATGACAGCTAAAACCTTTTTCATCATTCACGGATGGACGATG 
AGCGGTATCTTTGAAAACTGGCTGCACAAACTCGTGTCAGCCCTGCACACAAGAGAGAAAGAC 
GCCAATGTAGTTGTGGTTGACTGGCTCCCCCTGGCCCACCAGCTTTACACGGATGCGGTCAAT 
AATACCAGGGTGGTGGGACACAGCATTGCCAGGATGCTCGACTGGCTGCAGGAGAAGGACGAT 
TTTTCTCTCGGGAATGTCCACTTGATCGGCTACAGCCTCGGAGCGCACGTGGCCGGGTATGCA 
GGCAACTTCGTGAAAGGAACGGTGGGCCGAATCACAGGTTTGGATCCTGCCGGGCCCATGTTT 
GAAGGGGCCGACATCCACAAGAGGCTCTCTCCGGACGATGCAGATTTTGTGGATGTCCTCCAC 
ACCTACACGCGTTCCTTCGGCTTGAGCATTGGTATTCAGATGCCTGTGGGCCACATTGACATC 
TACCCCAATGGGGGTGACTTCCAGCCAGGCTGTGGACTCAACGATGTCTTGGGATCAATTGCA 
TATGGAACAATCACAGAGGTGGTAAAATGTGAGCATGAGCGAGCCGTCCACCTCTTTGTTGAC 
TCTCTGGTGAATCAGGACAAGCCGAGTTTTGCCTTCCAGTGCACTGACTCCAATCGCTTCAAA 
AAGGGGATCTGTCTGAGCTGCCGCAAGAACCGTTGTAATAGCATTGGCTACAATGCCAAGAAA 
ATGAGGAACAAGAGGAACAGCAAAATGTACCTAAAAACCCGGGCAGGCATGCCTTTCAGAGGT 
AACCTTCAG'TCCCTGGAGTGTCCCTSAGGAAGGCCCTTAATACCTCCTTCTTAATACCATGCT 
GCAGAGCAGGGCACATCCTAGCCCAGGAGAAGTGGCCAGCACAATCCAATCAAATCGTTGCAA 
ATCAGATTACACTGTGCATGTCCTAGGAAAGGGAATCTTTACAAAATAAACAGTGTGGACCCC 
TAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 400 

MSNSVPLLCFWSLCYCFAAGSPVPFGPEGRLEDKLHKPKATQTEVKPSVRFNLRTSKDPEHEG 
CYLSVGHSQPLEDCSFNMTAKTFFIIHGWTMSGIFENWLHKLVSALHTREKDANWWDWLPL 
AHQLYTDAVNNTRVVGHSIARMLDWLQEKDDFSLGNVHLIGYSLGAHVAGYAGNFVKGTVGRI 
TGLDPAGPMFEGADIHKRLSPDDADFVDVLHTYTRSFGLSIGIQMPVGHIDIYPNGGDFQPGC 
GLNDVLGSIAYGTITEWKCEHERAVHLFVDSLVNQDKPSFAFQCTDSNRFKKGICLSCRKNR 
CNSIGYNAKKMRNKRNSKMYLKTRAGMPFRGNLQSLECP 

Important features: 
Signal peptide: 

amino acids 1-16 

Lipases , serine active site. 

amino acids 163-172 

N-glycosylation sites . 

amino acids 80-83 and 136-139 
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FIGURE 401 

CTTCCCAGCCCTGTGCCCCAAAGCACCTGGAGCATATAGCCTTGCAGAACTTCTACTTGCCTG 
CCTCCCTGCCTCTGGCCATGGCCTGCCGGTGCCTCAGCTTCCTTCTGATGGGGACCTTCCTGT 
CAGTTTCCCAGACAGTCCTGGCCCAGCTGGATGCACTGCTGGTCTTCCCAGGCCAAGTGGCTC 
AACTCTCCTGCACGCTCAGCCCCCAGCACGTCACCATCAGGGACTACGGTGTGTCCTGGTACC 
AGCAGCGGGCAGGCAGTGCCCCTCGATATCTCCTCTACTACCGCTCGGAGGAGGATCACCACC 
GGCCTGCTGACATCCCCGATCGATTCTCGGCAGCCAAGGATGAGGCCCACAATGCCTGTGTCC 
TCACCATTAGTCCCGTGCAGCCTGAAGACGACGCGGATTACTACTGCTCTGTTGGCTACGGCT 
TTAGTCCC TAG GGGTGGGGTGTGAGATGGGTGCCTCCCCTCTGCCTCCCATTTCTGCCCCTGA 
CCTTGGGTCCCTTTTAAACTTTCTCTGAGCCTTGCTTCCCCTCTGTAAAATGGGTTAATAATA 
TTCAACATGTCAACAAC 



WO 01/40466 



PCT/US00/32678 



402/550 

FIGURE 402 

MACRCLSFLLMGTFLSVSQTVLAQLDALLVFPGQVAQLSCTLSPQHVTIRDYGVSWYQQRAGS 
APRYLLYYRSEEDHHRPADIPDRFSAAKDEAHNACVLTISPVQPEDDADYYCSVGYGFSP 
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FIGURE 403 

CGCGCCGGGCGCAGGGAGCTGAGTGGACGGCTCGAGACGGCGGCGCGTGCAGCAGCTCCAGAAAGCAGCGAGTTG 
GCAGAGCAGGGCTGCATTTCCAGCAGGAGCTGCGAGCACAGTGCTGGCTCACAACAAGATGCTCAAGGTGTCAGC 
CGTACTGTGTGTGTGTGCAGCCGCTTGGTGCAGTCAGTCTCTCGCAGCTGCCGCGGCGGTGGCTGCAGCCGGGGG 
GCGGTCGGACGGCGGTAATTTTCTGGATGATAAACAATGGCTCACCACAATCTCTCAGTATGACAAGGAAGTCGG 
ACAGTGGAACAAATTCCGAGACGAAGTAGAGGATGATTATTTCCGCACTTGGAGTCCAGGAAAACCCTTCGATCA 
GGCTTTAGATCCAGCTAAGGATCCATGCTTAAAGATGAAATGTAGTCGCCATAAAGTATGCATTGCTCAAGATTC 
TCAGACTGCAGTCTGCATTAGTCACCGGAGGCTTACACACAGGATGAAAGAAGCAGGAGTAGACCATAGGCAGTG 
GAGGGGTCCCATATTATCCACCTGCAAGCAGTGCCCAGTGGTCTATCCCAGCCCTGTTTGTGGTTCAGATGGTCA 
TACCTACTCTTTTCAGTGCAAACTAGAATATCAGGCATGTGTCTTAGGAAAACAGATCTCAGTCAAATGTGAAGG 
ACATTGCCCATGTCCTTCAGATAAGCCCACCAGTACAAGCAGAAATGTTAAGAGAGCATGCAGTGACCTGGAGTT 
CAGGGAAGTGGCAAACAGATTGCGGGACTGGTTCAAGGCCCTTCATGAAAGTGGAAGTCAAAACAAGAAGACAAA 
AACATTGCTGAGGCCTGAGAGAAGCAGATTCGATACCAGCATCTTGCCAATTTGCAAGGACTCACTTGGCTGGAT 
GTTTAACAGACTTGATACAAACTATGACCTGCTATTGGACCAGTCAGAGCTCAGAAGCATTTACCTTGATAAGAA 
TGAACAGTGTACCAAGGCATTCTTCAATTCTTGTGACACATACAAGGACAGTTTAATATCTAATAATGAGTGGTG 
CTACTGCTTCCAGAGACAGCAAGACCCACCTTGCCAGACTGAGCTCAGCAATATTCAGAAGCGGCAAGGGGTAAA 
GAAGCTCCTAGGACAGTATATCCGCCTGTGTGATGAAGATGGTTACTACAAGCCAACACAATGTCATGGCAGTGT 
TGGACAGTGCTGGTGTGTTGACAGATATGGAAATGAAGTCATGGGATCCAGAATAAATGGTGTTGCAGATTGTGC 
TATAGATTTTGAGATCTCCGGAGATTTTGCTAGTGGCGATTTTCATGAATGGACTGATGATGAGGATGATGAAGA 
CGATATTATGAATGATGAAGATGAAATTGAAGATGATGATGAAGATGAAGGGGATGATGATGATGGTGGTGATGA 
CCATGATGTATACATTTGATTGATGACAGTTGAAATCAATAAATTCTACATTTCTAATATTTACAAAAATGATAG 
CCTATTTAAAATTATCTTCTTCCCCAATAACAAAATGATTCTAAACCTCACATATATTTTGTATAATTATTTGAA 
AAATTGCAGCTAAAGTTATAGAACTTTATGTTTAAATAAGAATCATTTGCTTTGAGTTTTTATATTCCTTACACA 
AAAAGAAAATACATATGCAGTCTAGTCAGACAAAATAAAGTTTTGAAGTGCTACTATAATAAATTTTTCACGAGA 
ACAAACTTTGTAAATCTTCCATAAGCAAAATGACAGCTAGTGCTTGGGATCGTACATGTTAATTTTTTGAAAGAT 
AATTCTAAGTGAAATTTAAAATAAATAAATTTTTAATGACCTGGGTCTTAAGGATTTAGGAAAAATATGCATGCT 
TTAATTGCATTTCCAAAGTAGCATCTTGCTAGACCTAGAtGAGTCAGGATAACAGAGAGATACCACATGACTCCA 
AAAAAAAAAAAAAA 
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FIGURE 404 

MLKVSAVLCVCAAAWCSQSLAAAAAVAAAGGRSDGGNFLDDKQWLTTISQYDKEVGQWNKFRD 
EVEDDYFRTWSPGKPFDQALDPAKDPCLKMKCSRHKVCIAQDSQTAVCISHRRLTHRMKEAGV 
DHRQWRGPILSTCKQCPVVYPSPVCGSDGHTYSPQCKLEYQACVLGKQISVKCEGHCPCPSDK 
PTSTSRNVKRACSDLEFREVANRLRDWFKALHESGSQNKKTKTLLRPERSRFDTSILPICKDS 
LGWMFNRLDTNYDLLLDQSELRSIYLDKNEQCTKAFFNSCDTYKDSLISNNEWCYCFQRQQDP 
PCQTELSNIQKRQGVKKLLGQYIPLCDEDGYYKPTQCHGSVGQCWCVDRYGNEVMGSRINGVA 
DCAIDFEISGDFASGDFHEWTDDEDDEDDIMNDEDEIEDDDEDEGDDDDGGDDHDVYI 

Important features: 
Signal peptide: 

amino acids 1-16 

Leucine zipper pattern* 

amino acids 24 6-267 

N-myristoylation sites. 

amino acids 357-362, 371-376 and 376-381 

Thyroglobulin type-1 repeat proteins 

amino acids 353-365 and 339-352 
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FIGURE 405 

GGAAGGGGAGGAGCAGGCCACACAGGCACAGGCCGGTGAGGGACCTGCCCAGACCTGGAGGGTCTCGCTCTGTCA 

CACAGGCTGGAGTGCAGTGGTGTGATCTTGGCTCATCGTAACCTCCACCTCCCGGGTTCAAGTGATTCTCATGCC 

TCAGCCTCCCGAGTAGCTGGGATTACAGGTGGTGACTTCCAAGAGTGACTCCGTCGGAGGAAAATGACTCCCCAG 

TCGCTGCTGCAGACGACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTCCAAGGTGCCCACGGCAGGGGCCACAGG 

GAAGACTTTCGCTTCTGCAGCCAGCGGAACCAGACACACAGGAGCAGCCTCCACTACAAACCCACACCAGACCTG 

CGCATCTCCATCGAGAACTCCGAAGAGGCCCTCACAGTCCATGCCCCTTTCCCTGCAGCCCACCCTGCTTCCCGA 

TCCTTCCCTGACCCCAGGGGCCTCTACCACTTCTGCCTCTACTGGAACCGACATGCTGGGAGATTACATCTTCTC 

TATGGCAAGCGTGACTTCTTGCTGAGTGACAAAGCCTCTAGCCTCCTCTGCTTCCAGCACCAGGAGGAGAGCCTG 

GCTCAGGGCCCCCCGCTGTTAGCCACTTCTGTCACCTCCTGGTGGAGCCCTCAGAACATCAGCCTGCCCAGTGCC 

GCCAGCTTCACCTTCTCCTTCCACAGTCCTCCCCACACGGCCGCTCACAATGCCTCGGTGGACATGTGCGAGCTC 

AAAAGGGACCTCCAGCTGCTCAGCCAGTTCCTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCCTCGGCTGCCCCC 

GCCAGCCAGCAGTTGCAGAGCCTGGAGTCGAAACTGACCTCTGTGAGATTCATGGGGGACATGGTGTCCTTCGAG 

GAGGACCGGATCAACGCCACGGTGTGGAAGCTCCAGCCCACAGCCGGCCTCCAGGACCTGCACATCCACTCCCGG 

CAGGAGGAGGAGCAGAGCGAGATCATGGAGTACTCGGTGCTGCTGCCTCGAACACTCTTCCAGAGGACGAAAGGC 

CGGAGCGGGGAGGCTGAGAAGAGACTCCTCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAGGACAAGAATTCC 

AGCCAAGTCCTGGGTGAGAAGGTCTTGGGGATTGTGGTACAGAACACCAAAGTAGCCAACCTCACGGAGCCCGTG 

GTGCTCACTTTCCAGCACCAGCTACAGCCGAAGAATGTGACTCTGCAATGTGTGTTCTGGGTTGAAGACCCCACA 

TTGAGCAGCCCGGGGCATTGGAGCAGTGCTGGGTGTGAGACCGTCAGGAGAGAAACCCAAACATCCTGCTTCTGC 

AACCACTTGACCTACTTTGCAGTGCTGATGGTCTCCTCGGTGGAGGTGGACGCCGTGCACAAGCACTACCTGAGC 

CTCCTCTCCTACGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCCGCCTACCTCTGCTCCAGG 

GTGCCCCTGCCGTGCAGGAGGAAACCTCGGGACTACACCATCAAGGTGCACATGAACCTGCTGCTGGCCGTCTTC 

CTGCTGGACACGAGCTTCCTGCTCAGCGAGCCGGTGGCCCTGACAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCC 

ATCTTCCTGCACTTCTCCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTCTACCGACTCGTG 

GTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAAGCTGAGCGCCATGGGCTGGGGCTTCCCCATCTTT 

CTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCAGAG 

GGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGC 

CTGGTGTTTCTGTTCAACATGGCCATGCTAGCCACCATGGTGGTGCAGATCCTGCGGCTGCGCCCCCACACCCAA 

AAGTGGTCACATGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGCCTGCCCTGGGCCTTGATCTTCTTCTCC 

TTTGCTTCTGGCACCTTCCAGCTTGTCGTCCTCTACCTTTTCAGCATCATCACCTCCTTCCAAGGCTTCCTCATC 

TTCATCTGGTACTGGTCCATGCGGCTGCAGGCCCGGGGTGGCCCCTCCCCTCTGAAGAGCAACTCAGACAGCGCC 

AGGCTCCCCATCAGCTCGGGCAGCACCTCGTCCAGCCGCATCTAGGCCTCCAGCCCACCTGCCCATGTGATGAAG 

CAGAGATGCGGCCTCGTCGCACACTGCCTGTGGCCCCCGAGCCAGGCCCAGCCCCAGGCCAGTCAGCCGCAGACT 

TTGGAAAGCCCAACGACCATGGAGAGATGGGCCGTTGCCATGGTGGACGGACtCCCGGGCTGGGCTTTTGAATTG 

GCCTTGGGGACTACTCGGCTCTCACTCAGCTCCCACGGGACTCAGAAGTGCGCCGCCATGCTGCCTAGGGTACTG 

TCCCCACATCTGTCCCAACCCAGCTGGAGGCCTGGTCTCTCCTTACAACCCCTGGGCCCAGCCCTCATTGCTGGG 

GGCCAGGCCTTGGATCTTGAGGGTCTGGCACATCCTTAATCCTGTGCCCCTGCCTGGGACAGAAATGTGGCTCCA 

GTTGCTCTGTCTCTCGTGGTCACCCTGAGGGCACTCTGCATCCTCTGTCATTTTAACCTCAGGTGGCACCCAGGG 

CGAATGGGGCCCAGGGCAGACCTTCAGGGCCAGAGCCCTGGCGGAGGAGAGGCCCTTTGCCAGGAGCACAGCAGC 

AGCTCGCCTACCTCTGAGCCCAGGCCCCCTCCCTCCCTCAGCCCCCCAGTCCTCCCTCCATCTTCCCTGGGGTTC 

TCCTCCTCTCCCAGGGCCTCCTTGCTCCTTCGTTCACAGCTGGGGGTCCCCGATTCGAATGCTGTTTTTTGGGGA 

GTGGTTTCCAGGAGCTGCCTGGTGTCTGCTGTAAATGTTTGTCTACTGCACAAGCCTCGGCCTGCCCCTGAGCCA 

GGCTCGGTACCGATGCGTGGGCTGGGCTAGGTCCCTCTGTCCATCTGGGCCTTTGTATGAGCTGCATTGCCCTTG 

CTCACCCTGACCAAGCACACGCCTCAGAGGGGCCCTCAGCCTCTCCTGAAGCCCTCTTGTGGCAAGAACTGTGGA 

CCATGCCAGTCCCGTCTGGTTTCCATCCCACCACTCCAAGGACTGAGACTGACCTCCTCTGGTGACACTGGCCTA 

GAGCCTGACACTCTCCTAAGAGGTTCTCTCCAAGCCCCCAAATAGCT.CCAGGCGCCCTCGGCCGCCCATCATGGT 

TAATTCTGTCCAACAAACACACACGGGTAGATTGCTGGCCTGTTGTAGGTGGTAGGGACACAGATGACCGACCTG 

GTCACTCCTCCTGCCAACATTCAGTCTGGTATGTGAGGCGTGCGTGAAGCAAGAACTCCTGGAGCTACAGGGACA 

GGGAGCCATCATTCCTGCCTGGGAATCCTGGAAGACTTCCTGCAGGAGTCAGCGTTCAATCTTGACCTTGAAGAT 

GGGAAGGATGTTCTTTTTACGTACCAATTCTTTTGTCTTTTGATATTAAAAAGAAGTACATGTTCATTGTAGAGA 

ATTTGGAAACTGTAGAAGAGAATCAAGAAGAAAAATAAAAATCAGCTGTTGTAATCGCCTAGCAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 406 

MTPQSLLQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIENSEE 
ALTVHAPFPAAHPASRS FPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLLCFQHQE 
ESLAQGPPLLATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASVDMCELKRDLQLLSQF 
LKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATVWKLQPTAGLQDLHIH 
SRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQDKNSSQVLGEKVLGI 
VVQNTKVANLTE PVVLT FQHQLQPKN VTLQCVFWVE DPTL S S PGHWS SAGCET VRRETQT SC F 
CNHLTYFAVLMVSSVEVDAVHKHYLSLLSYVGCWSALACLVTIAAYLCSRVPLPCRRKPRDY 
TIKVHMNLLLAVFLLDTSFLLSEPVALTGSEAGCRASAIFLHFSLLTCLSWMGLEGYNLYRLV 
VEVFGTYVPGYLLKLSAMGWGFPIFLVTLVALVDVDNYGPIILAVHRTPEGVIYPSMCWIRDS 
LVSYITNLGLFSLVFLFNMAMLATMVVQILRLRPHTQKWSHVLTLLGLSLVLGLPWALIFFSF 
ASGTFQLWLYLFSIITSFQGFLIFIWYWSMRLQARGGPSPLKSNSDSARLPISSGSTSSSRI 

Important features: 

Signal peptide: 

amino acids 1-25 

Putative transmembrane domains: 

amino acids 382-398, 402-420, 445-468, 473-491, 519-537, 568-590 
and 634-657 

Microbodies C- terminal targeting signal. 

amino acids 691-693 

cAMP- and cGMP-dependent protein kinase phosphorylation sites . 

amino acids 198-201 and 370-373 
N-glycosylation sites. 

amino acids 39-42, 148-151, 171-174, 234-237, 303-306, 324-327 
and 341-344 

G-protein coupled receptors family 2 proteins 

amino acids 475-504 
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FIGURE 407 

TTGTGACTAAAAGCTGGCCTAGCAGGCCAGGGAGTGCAGCTGCAGGCGTGGGGGTGGCAGGAG 
CCGCAGAGCCAGAGCAGACAGCCGAGAAACAGGTGGACAGTGTGAAAGAACCAGTGGTCTCGC 
TCTGTTGCCCAGGCTAGAGTGTACTGGCGTGATCATAGCTCACTGCAGCCTCAGACTCCTGGA 
CTTGAGAAATCCTCCTGCCTTAGCCTCCTGCATATCTGGGACTCCAGGGGTGCACTCAAGCCC 
TGTTTCTTCTCCTTCTGTGAGTGGACCACGGAGGCTGGTGAGCTGCCTGTCATCCCAAAGCTC 
AGCTCTGAGCCAGAGTGGTGGTGGCTCCACCTCTGCCGCCGGCATAGAAGCCAGGAGCAGGGC 
TCTCAGAAGGCGGTGGTGCCCAGCTGGGATCATGTTGTTGGCCCTGGTCTGTCTGCTCAGCTG 
CCTGCTACCCTCCAGTGAGGCCAAGCTCTACGGTCGTTGTGAACTGGCCAGAGTGCTACATGA 
CTTCGGGCTGGACGGATACCGGGGATACAGCCTGGCTGACTGGGTCTGCCTTGCTTATTTCAC 
AAGCGGTTTCAACGCAGCTGCTTTGGACTACGAGGCTGATGGGAGCACCAACAACGGGATCTT 
CCAGATCAACAGCCGGAGGTGGTGCAGCAACCTCACCCCGAACGTCCCCAACGTGTGCCGGAT 
GTACTGCTCAGATTTGTTGAATCCTAATCTCAAGGATACCGTTATCTGTGCCATGAAGATAAC 
CCAAGAGCCTCAGGGTCTGGGTTACTGGGAGGCCTGGAGGCATCACTGCCAGGGAAAAGACCT 
CACTGAATGGGTGGATGGCTGTGACTTCTA6GATGGACGGAACCATGCACAGCAGGCTGGGAA 
ATGTGGTTTGGTTCCTGACCTAGGCTTGGGAAGACAAGCCAGCGAATAAAGGATGGTTGAACG 
TGAAA 
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FIGURE 408 

MLLALVCLLSCLLPSSEAKLYGRCELARVLHDFGLDGYRGYSLADWVCLAYFTSGFNAAALDY 
EADGSTNNGIFQINSRRWCSNLTPNVPNVCRMYCSDLLNPNLKDTVICAMKITQEPQGLGYWE 
AWRHHCQGKDLTEWVDGCDF 

Important features: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids. 67-72 

Homolgous region to Alpha-lactalbuxnin / lysozyme C proteins. 

amino acids 34-58 (catalytic domain), 111-132 and 66-107 
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FIGURE 409 

CAGACTCCAGATTTCCCTGTCAACCACGAGGAGTCCAGAGAGGAAACGCGGAGCGGAGACAACAGTACCTGACGC 

CTCTTTCAGCCCGGGATCGCCCCAGCAGGGATGGGCGACAAGATCTGGCTGCCCTTCCCCGTGCTCCTTCTGGCC 

GCTCTGCCTCCGGTGCTGCTGCCTGGGGCGGCCGGCTTCACACCTTCCCTCGATAGCGACTTCACCTTTACCCTT 

CCCGCCGGCCAGAAGGAGTGCTTCTACCAGCCCATGCCCCTGAAGGCCTCGCTGGAGATCGAGTACCAAGTTTTA 

GATGGAGCAGGATTAGATATTGATTTCCATCTTGCCTCTCCAGAAGGCAAAACCTTAGTTTTTGAACAAAGAAAA 

TCAGATGGAGTTCACACTGTAGAGACTGAAGTTGGTGATTACATGTTCTGCTTTGACAATACATTCAGCACCATT 

TCTGAGAAGGTGATTTTCTTTGAATTAATCCTGGATAATATGGGAGAACAGGCACAAGAACAAGAAGATTGGAAG 

AAATATATTACTGGCACAGATATATTGGATATGAAACTGGAAGACATCCTGGAATCCATCAACAGCATCAAGTCC 

AGACTAAGCAAAAGTGGGCACATACAAATTCTGCTTAGAGCATTTGAAGCTCGTGATCGAAACATACAAGAAAGC 

AACTTTGATAGAGTCAATTTCTGGTCTATGGTTAATTTAGTGGTCATGGTGGTGGTGTCAGCCATTCAAGTTTAT 

ATGCTGAAGAGTCTGTTTGAAGATAAGAGGAAAAGTAGAACTTAAAACTCCAAACTAGAGTACGTAACATTGAAA 

AATGAGGCATAAAAATGCAATAAACTGTTACAGTCAAGACCATTAATGGTCTTCTCCAAAATATTTTGAGATATA 

AAAGTAGGAAACAGGTATAATTTTAATGTGAAAATTAAGTCTTCACTTTCTGTGCAAGTAATCCTGCTGATCCAG 

TTGTACTTAAGTGTGTAACAGGAATATTTTGCAGAATATAGGTTTAACTGAATGAAGCCATATTAATAACTGCAT 

TTTCCTAACTTTGAAAAATTTTGCAAATGTCTTAGGTGATTTAAATAAATGAGTATTGGGCCTAATTGCAACACC 

AGTCTGTTTTTAACAGGTTCTATTACCCAGAACTTTTTTGTAAATGCGGCAGTTACAAATTAACTGTGGAAGTTT 

TCAGTTTTAAGTTATAAATCACCTGAGAATTACCTAATGATGGATTGAATAAATCTTTAGACTACAAAAGCCCAA 

CTTTTCTCTATTTACATATGCATCTCTCCTATAATGTAAATAGAATAATAGCTTTGAAATACAATTAGGTTTTTG 

AGATTTTTATAACCAAATACATTTCAGTGTAACATATTAGCAGAAAGCATTAGTCTTTGTACTTTGCTTACATTC 

CCAAAAGCTGACATTTTCACGATTCTTAAAAACACAAAGTTACACTTACTAAAATTAGGACATGTTTTCTCTTTG 

AAATGAAGAATATAGTTTAAAAGCTTCCTCCTCCATAGGGACACATTTTCTCTAACCCTTAACTAAAGTGTAGGA 

TTTTAAAATTAAATGTGAGGTAAAATAAGTTTATTTTTAATAGTATCTGTCAAGTTAATATCTGTCAACAGTTAA 

TAATCATGTTATGTTAATTTTAACATGATTGCTGACTTGGATAATTCATTATTACCAGCAGTTATGAAGGAAATA 

TTGCTAAAATGATCTGGGCCTACCATAAATAAATATCTCCTTTTCTGAGCTCTAAGAATTATCAGAAAACAGGAA 

AGAATTTAGAAAAACTTGAGAAAACCTAATCCAAAATAAAATTCACTTAAGTAGAACTATAAATAAATATCTAGA 

ATCTGACTGGCTCATCATGACATCCTACTCATAACATAAATCAAAGGAGATGATTAATTTCCAGTTAGCTGGAAG 

AAACTTTGGCTGTAGGTTTTTATTTTCTACAAGAATTCTGGTTTGAATTATTTTTGTAAGCAGGTACATTTTATA 

AAATGTAAGCCCTACTGTAAGGTTTAGCACTGGGTGTACATATTTATTAAAAATTTTTATTATAACAACTTTTAT 

TAAAATGGCCTTTCTGAACACTTTATTTATTGATGTTGAAGTAAGGATTAGAAACATAGACTCCCAAGTTTTAAA 

CACCTAAATGTGAATAACCCATATATACAACAAAGTTTCTGCCATCTAGCTTTTTGAAGTCTATGGGGGTCTTAC 

TCAAGTACTAGTAATTTAACTTCATCATGAATGAACTATAATTTTTAAGTTATGCCCATTTATAACGTTGTTTAT 

GACTACATTGTGAGTTAGAAACAAACTTAAAATTTGGGGTATAGAACCCCTCAACAGGTTAGTAATGCTGGAATT 

CTTGATGAGCAATAATGATAACCAGAGAGTGATTTCATTTACACTCATAGTAGTATAAAAAGAGATACATTTCCC 

TCTTAGGCCCCTGGGAGAAGAGCAGCTTAGATTTCCCTACTGGCAAGGTTTTTAAAAATGAGGTAAATGCCGTAT 

ATGATCAATTACCTTAATTGGCCAAGAAAAtGCTTCAGGTGTCTAGGGGTATCCTCTGCAACACTTGCAGAACAA 

AGGTCAATAAGATCCTTGCCTATGAATACCCCTCCCTTTTGCGCTGTTAAATTTGCAATGAGAAGCAAATTTACA 

GTACCATAACTAATAAAGCAGGGTACAGATATAAACTACTGCATCTTTTCTATAAAACTGTGATTAAGAATTCTA 

CCTCTCCTGTATGGCTGTTACTGTACTGTACTCTCTGACTCCTTACCTAACAATGAATTTGTTACATAATCTTCT 

ACATGTATGATTTGTGCCACTGATCTTAAACCTATGATTCAGTAACTTCTTACCATATAAAAACGATAATTGCTT 

TATTTGGAAAAGAATTTAGGAATACTAAGGACAATTATTTTTATAGACAAAGTAAAAAGACAGATATTTAAGAGG 

CATAACCAAAAAAGCAAAACTTGTAAACAGAGTAAAAATCTTTAATATTTCTAAAGACATACTGTTTATCTGCTT 

CATATGCTTTTTTTAATTTCACTATTCCATTTCTAAATTAAAGTTATGCTAAATTGAGTAAGCTGTTTATCACTT 

AACAGCTCATTTTGTCTTTTTCAATATACAAATTTTAAAAATACTACAATATTTAACTAAGGCCCAACCGATTTC 

CATAATGTAGCAGTTACCGTGTTCACCTCACACTAAGGCCTAGAGTTTGCTCTGATATGCATTTGGATGATTAAT 

GTTATGCTGTTCTTTCATGTGAATGTCAAGACATGGAGGGTGTTTGTAATTTTATGGTAAAATTAATCCTTCTTA 

CACATAATGGTGTCTTAAAATTGACAAAAAATGAGCACTTACAATTGTATGTCTCCTCAAATGAAGATTCTTTAT 

GTGAAATTTTAAAAGACATTGATTCCGCATGTAAGGATTTTTCATCTGAAGTACAATAATGCACAATCAGTGTTG 

CTCAAACTGCTTTATACTTATAAACAGCCATCTTAAATAAGCAACGTATTGTGAGTACTGATATGTATATAATAA 

AAATTATCAAAGGAAAA 
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FIGURE 410 

MGDKIWLPFPVLLLAALPPVLLPGAAGFTPSLDSDFTFTLPAGQKECFYQPMPLKASLEIEYQ 
.VLDGAGLDIDFHLASPEGKTLVFEQRKSDGVHTVETEVGDYMFCFDNTFSTISEKVI.FFELIL 
DNMGEQAQEQEDWKKYITGTDILDMKLEDILESINSIKSRLSKSGHIQILLRAFEARDRNIQE 
SNFDRVNFWSMVNLWMVVVSAIQVYMLKSLFEDKRKSRT 

Important features: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 

amino acids 195-217 

N-myristoylation site. 

amino acids 43-48 

Tyrosine kinase phosphorylation site. 

amino acids 55-62 
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FIGURE 411 



CCCAGCTGAGGAGCCCTGCTCAAGACACGGTCACTGGATCTGAGAAACTTCCCAGGGGACCGCATTCCAGAGTCA 
GTGACTCTGTGAAGCACCCACATCTACCTCTTGCCACGTTCCCACGGGCTTGGGGGAAAGATGGTGGGGACCAAG 
GCCTGGGTGTTCTCCTTCCTGGTCCTGGAAGTCACATCTGTGTTGGGGAGACAGACGATGCTCACCCAGTCAGTA 
AGAAGAGTCCAGCCTGGGAAGAAGAACCCCAGCATCTTTGCCAAGCCTGCCGACACCCTGGAGAGCCCTGGTGAG 
TGGACAACATGGTTCAACATCGACTACCCAGGCGGGAAGGGCGACTATGAGCGGCTGGAGGCCATTCGCTTCTAC 
TATGGGGACCGTGTATGTGCCCGTCCCCTGCGGCTAGAGGCTCGGACCACTGACTGGACACCTGCGGGCAGCACT 
GGCCAGGTGGTCCATGGTAGTCCCCGTGAGGGTTTCTGGTGCCTCAACAGGGAGCAGCGGCCTGGCCAGAACTGC 
TCTAATTACACCGTACGCTTCCTCTGCCCACCAGGATCCCTGCGCCGAGACACAGAGCGCATCTGGAGCCCATGG 
TCTCCCTGGAGCAAGTGCTCAGCTGCCTGTGGTCAGACTGGGGTCCAGACTCGCACACGCATTTGCTTGGCAGAG 
ATGGTGTCGCTGTGCAGTGAGGCCAGCGAAGAGGGTCAGCACTGCATGGGCCAGGACTGTACAGCCTGTGACCTG 
ACCTGCCCAATGGGCCAGGTGAATGCTGACTGTGATGCCTGCATGTGCCAGGACTTCATGCTTCATGGGGCTGTC 
TCCCTTCCCGGAGGTGCCCCAGCCTCAGGGGCTGCTATCTACCTCCTGACCAAGACGCCGAAGCTGCTGACCCAG 
ACAGACAGTGATGGGAGATTCCGAATCCCTGGCTTGTGCCCTGATGGCAAAAGCATCCTGAAGATCACAAAGGTC 
AAGTTTGCCCCCATTGTACTCACAATGCCCAAGACTAGCCTGAAGGCAGCCACCATCAAGGCAGAGTTTGTGAGG 
GCAGAGACTCCATACATGGTGATGAACCCTGAGACAAAAGCACGGAGAGCTGGGCAGAGCGTGTCTCTGTGCTGT 
AAGGCCACAGGGAAGCCCAGGCCAGACAAGTATTTTTGGTATCATAATGACACATTGCTGGATCCTTCCCTCTAC 
AAGCATGAGAGCAAGCTGGTGCTGAGGAAACTGCAGCAGCACCAGGCTGGGGAGTACTTTTGCAAGGCCCAGAGT 
GATGCTGGGGCTGTGAAGTCCAAGGTTGCCCAGCTGATTGTCACAGCATCTGATGAGACTCCTTGCAACCCAGTT 
CCTGAGAGCTATCTTATCCGGCTGCCCCATGATTGCTTTCAGAATGCCACCAACTCCTTCTACTATGACGTGGGA 
CGCTGCCCTGTTAAGACTTGTGCAGGGGAGCAGGATAATGGGATCAGGTGCCGTGATGCTGTGCAGAACTGCTGT 
GGCATCTCCAAGACAGAGGAAAGGGAGATCCAGTGCAGTGGCTACACGCTACCCACCAAGGTGGCCAAGGAGTGC 
AGCTGCCAGCGGTGTACGGAAACTCGGAGCATCGTGCGGGGCCGTGTCAGTGCTGCTGACAATGGGGAGCCCATG 
CGCTTTGGCCATGTGTACATGGGGAACAGCCGTGTAAGCATGACTGGCTACAAGGGCACTTTCACCCTCCATGTC 
CCCCAGGACACTGAGAGGCTGGTGCTCACATTTGTGGACAGGCTGCAGAAGTTTGTCAACACCACCAAAGTGCTA 
CCTTTCAACAAGAAGGGGAGTGCCGTGTTCCATGAAATCAAGATGCTTCGTCGGAAAGAGCCCATCACTTTGGAA 
GCCATGGAGACCAACATCATCCCCCTGGGGGAAQTGGTTGGTGAAGACCCCATGGCTGAACTGGAGATTCCATCC 
AGGAGTTTCTACAGGCAGAATGGGGAGCCCTACATAGGAAAAGTGAAGGCCAGTGTGACCTTCCTGGATCCCCGG 
AATATTTCCACAGCCACAGCTGCCCAGACTGACCTGAACTTCATCAATGACGAAGGAGACACTTTCCCCCTTCGG 
ACGTATGGCATGTTCTCTGTGGACTTCAGAGATGAGGTCACCTCAGAGCCACTTAATGCTGGCAAAGTGAAGGTC 
CACCTTGACTCGACCCAGGTCAAGATGCCAGAGCACATATCCACAGTGAAACTCTGGTCACTCAATCCAGACACA 
GGGCTGTGGGAGGAGGAAGGTGATTTCAAATTTGAAAATCAAAGGAGGAACAAAAGAGAAGACAGAACCTTCCTG 
GTGGGCAACCTGGAGATTCGTGAGAGGAGGCTCTTTAACCTGGATGTTCCTGAAAGCAGGCGGTGCTTTGTTAAG 
GTGAGGGCCTACCGGAGTGAGAGGTTCTTGCCTAGTGAGCAGATCCAGGGGGTTGTGATCTCCGTGATTAACCTG 
GAGCCTAGAACTGGCTTCTTGTCCAACCCTAGGGCCTGGGGCCGCTTTGACAGTGTCATCACAGGCCCCAACGGG 
GCCTGTGTGCCTGCCTTCTGTGATGACCAGTCCCCTGATGCCTACTCTGCCTATGTCTTGGCAAGCCTGGCTGGG 
GAGGAACTGCAAGCAGTGGAGTCTTCTCCTAAATTCAACCCAAATGCAATTGGCGTCCCTCAGCCCTATCTCAAC 
AAGCTCAACTACCGTCGGACGGACCATGAGGATCCACGGGTTAAAAAGACAGCTTTCCAGATTAGCATGGCCAAG 
CCAAGGCCCAACTCAGCTGAGGAGAGCAATGGGCCCATCTATGCCTTTGAGAACCTCCGGGCATGTGAAGAGGCA 
CCACCCAGTGCAGCCCACTTCCGGTTCTACCAGATTGAGGGGGATCGATATGACTACAACACAGTCCCCTTCAAC 
GAAGATGACCCTATGAGCTGGACTGAAGACTATCTGGCATGGTGGCCAAAGCCGATGGAATTCAGGGCCTGCTAT 
ATCAAGGTGAAGATTGTGGGGCCACTGGAAGTGAATGTGCGATCCCGCAACATGGGGGGCACTCATCGGCGGACA 
GTGGGGAAGCTGTATGGAATCCGAGATGTGAGGAGCACTCGGGACAGGGACCAGCCCAATGTCTCAGCTGCCTGT 
CTGGAGTTCAAGTGCAGTGGGATGCTCTATGATCAGGACCGTGTGGACCGCACCCTGGTGAAGGTCATCCCCCAG 
GGCAGCTGCCGTCGAGCCAGTGTGAACCCCATGCTGCATGAGTACCTGGTCAACCACTTGCCACTTGCAGTCAAC 
AACGACACCAGTGAGTACACCATGCTGGCACCCTTGGACCCACTGGGCCACAACTATGGCATCTACACTGTCACT 
GACCAGGACCCTCGCACGGCCAAGGAGATCGCGCTCGGCCGGTGCTTTGATGGCACATCCGATGGCTCCTCCAGA 
ATCATGAAGAGCAATGTGGGAGTAGCCCTCACCTTCAACTGTGTAGAGAGGCAAGTAGGCCGCCAGAGTGCCTTC 
CAGTACCTCCAAAGCACCCCAGCCCAGTCCCCTGCTGCAGGCACTGTCCAAGGAAGAGTGCCCTCGAGGAGGCAG 
CAGCGAGCGAGCAGGGGTGGCCAGCGCCAGGGTGGAGTGGTGGCCTCTCTGAGATTTCCTAGAGTTGCTCAACAG 
CCCCTGATCAACTAAGTTTTGTGGTACTTCACCCTCTTCTGCCCTCATTTCATGTGACAGCCATTGTGAGACTGA 
TGCACAAACTGTCACTTGGTTAATTTAAGCACTTCTGTTTTCGTGAATTTGCTTGTTTGTTTCTTCATGCCTTTA 
CTTACTTTGTCCCATGCTACTGATTGGCACGTGGCCCCCACAATGGCACAATAAAGCCCCTTTGTGAAACTGTTC 
TTTAAATGT^AACACAAGAAATTGGCCACTGGTAAAACTCTGCAGCTTCAACTGTACTTCATTTAATGCCATTAAT 
GCAAATATACTTCCTCTTCTTTTTGCATGGTTTTGCCCACCTCTGCAATAGTGATAATCTGATGCTGAAGATCAA 
ATAACCAATATAAAGCATATTTCTTGGCCTTGCTCCACAGGACATAGGCAAGCCTTGATCATAGTTCATACATAT 
AAATGGTGGTGAAATAAAGAAATAAAACACAATACTTTTACTTGAAATGTAAATAACTTATTTATTTCTTTGCTA 
AATTTGGAATTCTAGTGCACATTCAAAGTTAAGCTATTAAATATAGGGTGATCATAGTTCCTCTACCAAGTCTGG 
AAAGAACATCTCCTGGTATCCACAATTACACCAGGTTGCTAACTGTATTTGTACATTTCCCTTTGCATTCGCTTT 
TGTTCTTGCTAGAAACCCAGTGTAGCCCAGGGCAGATGTCAATAAATGCATACTCTGTATTTCGAAAAAA 
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FIGURE 412 

MVGTKAWVFSFLVLEVTSVLGRQTMLTQSVRRVQPGKKNPSIFAKPADTLESPGEWTTWFNID 
YPGGKGDYERLDAIRFYYGDRVCARPLRLEARTTDWTPAGSTGQVVHGSPREGFWCLNREQRP 
GQNCSNYTVRFLCPPGSLRRDTERIWSPWSPWSKCSAACGQTGVQTRTRICLAEMVSLCSEAS 
EEGQHCMGQDCTACDLTCPMGQVNADCDACMCQDFMLHGAVSLPGGAPASGAAIYLLTKTPKL 
LTQTDSDGRFRIPGLCPDGKSILKITKVKFAPIVLTMPKTSLKAATIKAEFVRAETPYMVMNP 
ETICARRAGQSVSLCCKATGKPRPDKYFWYHNDTLLDPSLYKHESKLVLRKLQQHQAGEYFCKA 
QS DAGAVKSKVAQLI VTAS DET PCN PVPES YLI RLPHDCFQNATNS FYY DVGRCPVKTCAGQQ 
DNGIRCRDAVQNCCGISKTEEREIQCSGYTLPTKVAKECSCQRCTETRSIVRGRVSAADNGEP 
MRFGHVYMGNSRVSMTGYKGT FTLHVPQDTERLVLTFVDRLQKFVNTTKVLPFNKKGSAVFHE 
IKMLRRKEPITLEAMETNIIPLGEWGEDPMAELEIPSRSFYRQNGEPYIGKVKASVTFLDPR 
NISTATAAQTDLNFINDEGDTFPLRTYGMFSVDFRDEVTSEPLNAGKVKVHLDSTQVKMPEHI 
STVKLWSLNPDTGLWEEEGDFKFENQRRNKREDRTFLVGNLEIRERRLFNLDVPESRRCFVKV 
RAYRSERFLPSEQIQGWISVINLEPRTGFLSNPRAWGRFDSVITGPNGACVPAFCDDQSPDA 
YSAYVLASLAGEELQAVESSPKFNPNAIGVPQPYLNKLNYRRTDHEDPRVKKTAFQISMAKPR 
PNSAEESNGPIYAFENLRACEEAPPSAAHFRFYQIEGDRYDYNTVPFNEDDPMSWTEDYLAWW 
PKPMEFRACYIKVKIVGPLEVNVRSRNMGGTHRRTVGKLYGIRDVRSTRDRDQPNVSAACLEF 
KCSGMLYDQDRVDRTLVKVIPQGSCRRASVNPMLHEYLVNHLPLAVNNDTSEYTMLAPLDPLG 
HNYGIYTVTDQDPRTAKEIALGRCFDGTSDGSSRIMKSNVGVALTFNCVERQVGRQSAFQYLQ 
STPAQSPAAGTVQGRVPSRRQQRASRGGQRQGGWASLRFPRVAQQPLIN 
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FIGURE 413 

GCCACGTTGTCTTCTTTCCTTCACCACCACCCAGGAGCTCAGAGATCTAAGCTGCTTTCCATC 
TTTTCTCCCAGCCCCAGGACACTGACTCTGTACAGGATGGGGCCGTCCTCTTGCCTCCTTCTC 
ATCCTAATCCCCCTTCTCCAGCTGATCAACCCGGGGAGTACTCAGTGTTCCTTAGACTCCGTT 
ATGGATAAGAAGATGAAGGATGTTCTCAACAGTCTAGAGTACAGTCCCTCTCCTATAAGC^UIG 
AAGCTCTCGTGTGCTAGTGTCAAAAGCCAAGGCAGACCGTCCTCCTGCCCTGCTGGGATGGCT 
GTCACTGGCTGTGCTTGTGGCTATGGCTGTGGTTCGTGGGATGTTCAGCTGGAAACCAGCTGC 
CACTGCCAGTGCAGTGTGGTGGACTGGACCACTGCCCGCTGCTGCCACCTGACCTGACAGGGA 
GGAGGCTGAGAACTCAGTTTTGTGACCATGACAGTAATGAAACCAGGGTCCCAACCAAGAAAT 
CTAACTCAAACGTCCCACTTCATTTGTTCCATTCCTGATTCTTGGGTAATAAAGACAAACTTT 
GTACCTCAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 414 

MGPSSCLLLILIPLLQLINPGSTQCSLDSVMDKKIKDVLNSLEYSPSPISKKLSCASVKS 
QGRPSSCPAGMAVTGCACGYGCGSWDVQLETTCHCQCSWDWTTARCCHLT 
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FIGURE 415 

CAGAAGAGGGGGCTAGCTAGCTGTCTCTGCGGACCAGGGAGACCCCCGCGCCCCCCCGGTGTG 
AGGCGGCCTCACAGGGCCGGGTGGGCTGGCGAGCCGACGCGGCGGCGGAGGAGGCTGTGAGGA 
GTGTGTGGAACAGGACCCGGGACAGAGGAACCMGGCTCCGCAGAACCTGAGCACCTTTTGCC 
TGTTGCTGCTATACCTCATCGGGGCGGTGATTGCCGGACGAGATTTCTATAAGATCTTGGGGG 
TGCCTCGAAGTGCCTCTATAAAGGATATTAAAAAGGCCTATAGGAAACTAGCCCTGCAGCTTC 
ATCCCGACCGGAACCCTGATGATCCACAAGCCCAGGAGAAATTCCAGGATCTGGGTGCTGCTT 
ATGAGGTTCTGTCAGATAGTGAGAAACGGAAACAGTACGATACTTATGGTGAAGAAGGATTAA 
AAGATGGTCATCAGAGCTCCCATGGAGACATTTTTTCACACTTCTTTGGGGATTTTGGTTTCA 
TGTTTGGAGGAACCCCTCGTCAGCAAGACAGAAATATTCCAAGAGGAAGTGATATTATTGTAG 
ATCTAGAAGTCACTTTGGAAGAAGTATATGCAGGAAATTTTGTGGAAGTAGTTAGAAACAAAC 
CTGTGGCAAGGCAGGCTCCTGGCAAACGGAAGTGCAATTGTCGGCAAGAGATGCGGACCACCC 
AGCTGGGCCCTGGGCGCTTCCAAATGACCCAGGAGGTGGTCTGCGACGAATGCCCTAATGTCA 
AACTAGTGAATGAAGAACGAACGCTGGAAGTAGAAATAGAGCCTGGGGTGAGAGACGGCATGG 
AGTACCCCTTTATTGGAGAAGGTGAGCCTCACGTGGATGGGGAGCCTGGAGATTTACGGTTCC 
GAATCAAAGTTGTCAAGCACCCAATATTTGAAAGGAGAGGAGATGATTTGTACACAAATGTGA 
CAATCTCATTAGTTGAGTCACTGGTTGGCTTTGAGATGGATATTACTCACTTGGATGGTCACA 
AGGTACATATTTCCCGGGATAAGATCACCAGGCCAGGAGCGAAGCTATGGAAGAAAGGGGAAG 
GGCTCCCCAACTT'TGACAACAACAATATCAAGGGCTCTTTGATAATCACTTTTGATGTGGATT 
TTCCAAAAGAACAGTTAACAGAGGAAGCGAGAGAAGGTATCAAACAGCTACTGAAACAAGGGT 
CAGTGCAGAAGGTATACAATGGACTGCAAGGATATTGAGAGTGAATAAAATTGGACTTTGTTT 
AAAATAAGTGAATAAGCGATATTTATTATCTGCAAGGTTTTTTTGTGTGTGTTTTTGTTTTTA 
TTTTCAATATGCAAGTTAGGCTTAATTTTTTTATCTAATGATCATCATGAAATGAATAAGAGG 
GCTTAAGAATTTGTCCATTTGCATTCGGAAAAGAATGACCAGCAAAAGGTTTACTAATACCTC 
TCCCTTTGGGGATTTAATGTCTGGTGCTGCCGCCTGAGTTTCAAGAATTAAAGCTGCAAGAGG 
ACTCCAGGAG.CAAAAGAAACACAATATAGAGGGTTGGAGTTGTTAGCAATTTCATTCAAAATG 
CCAACTGGAGAAGTCTGTTTTTAAATACATTTTGTTGTTATTTTTA 
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FIGURE 416 

MAPQNLSTFCLLLLYLIGAVIAGRDFYKILGVPRSASIKDIKKAYRKLALQLHPDRNPDDPQAQEKFQDLGAAYE 
VLSDSEKRKQYDTYGEEGLKDGHQSSHGDIFSHFFGDFGFMFGGTPRQQDRNIPRGSDIIVDLEVTLEEVYAGNF 
VEVVRNKPVARQAPGKRKCNCRQEMRTTQLGPGRFQMTQEVVCDECPNVKLVNEERTLEVEIEPGVRDGMEYPFI 
GEGEPHVDGEPGDLRFRIKWKHPIFERRGDDLYTNVTISLVESLVGFEMDITHLDGHKVHISRDKITRPGAKLW 
KKGEGLPNFDNNNIKGSLIITFDVDFPKEQLTEEAREGIKQLLKQGSVQKVYNGLQGY 

Important features: 
Signal peptide: 

amino acids 1-22 

Cell attachment sequence. 

amino acids 254-257 

Nt-dnaJ domain signature. 

amino acids 67-87 

Homologous region to Nt-dnaJ domain proteins. 

amino acids 26-58 

N-glycosylation site. 

amino acids 5-9, 261-265 

Tyrosine kinase phosphorylation site. 

amino acids 253-260 

N-myristoylation site. 

amino acids 18-24, 31-37, 93-99, 215-221 



Ami da t ion site. 

amino acids 164-168 
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FIGURE 417 

CGGCGGCGGCTGCGGGCGCGAGGTGAGGGGCGCGAGGTGAGGGGCGCGAGGTTCCCAGCAGGA 
TGCCCCGGCTCTGCAGGAAGCTGAAGTGAGAGGCCCGGAGAGGGCCCAGCCCGCCCGGGGCAG 
GMSACCAAGGCCCGGCTGTTCCGGCTGTGGCTGGTGCTGGGGTCGGTGTTCATGATCCTGCT 
GATCATCGTGTACTGGGACAGCGCAGGCGCCGCGCACTTCTACTTGCACACGTCCTTCTCTAG 
GCCGCACACGGGGCCGCCGCTGCCCACGCCCGGGCCGGACAGGGACAGGGAGCTCACGGCCGA 
CTCCGATGTCGACGAGTTTCTGGACAAGTTTCTCAGTGCTGGCGTGAAGCAGAGCGACCTTCC 
CAGAAAGGAGACGGAGCAGCCGCCTGCGCCGGGGAGCATGGAGGAGAGCGTGAGAGGCTACGA 
CTGGTCCCCGCGCGACGCCCGGCGCAGCCCAGACCAGGGCCGGCAGCAGGCGGAGCGGAGGAG 
CGTGCTGCGGGGCTTCTGCGCCAACTCCAGCCTGGCCTTCCCCACCAAGGAGCGCGCATTCGA 
CGACATCCCCAACTCGGAGCTGAGCCACCTGATCGTGGACGACCGGCACGGGGCCATCTACTG 
CTACGTGCCCAAGGTGGCCTGCACCAACTGGAAGCGCGTGATGATCGTGCTGAGCGGAAGCCT 
GCTGCACCGCGGTGCGCCCTACCGCGACCCGCTGCGCATCCCGCGCGAGCACGTGCACAACGC 
CAGCGCGCACCTGACCTTCAACAAGTTCTGGCGCCGCTACGGGAAGCTCTCCCGCCACCTCAT 
GAAGGTCAAGCTCAAGAAGTACACCAAGTTCCTCTTCGTGCGCGACCCCTTCGTGCGCCTGAT 
CTCCGCCTTCCGCAGCAAGTTCGAGCTGGAGAACGAGGAGTTCTACCGCAAGTTCGCCGTGCC 
CATGCTGCGGCTGTACGCCAACCACACCAGCCTGCCCGCCTCGGCGCGCGAGGCCTTCCGCGC 
TGGCCTCAAGGTGTCCTTCGCCAACTTCATCCAGTACCTGCTGGACCCGCACACGGAGAAGCT 
GGCGCCCTTCAACGAGCACTGGCGGCAGGTGTACCGCCTCTGCCACCCGTGCCAGATCGACTA 
CGACTTCGTGGGGAAGCTGGAGACTCTGGACGAGGACGCCGCGCAGCTGCTGCAGCTACTCCA 
GGTGGACCGGCAGCTCCGCTTCCCCCCGAGCTACCGGAACAGGACCGCCAGCAGCTGGGAGGA 
GGACTGGTTCGCCAAGATCCCCCTGGCCTGGAGGCAGCAGCTGTATAAACTCTACGAGGCCGA 
CTTTGTTCTCTTCGGCTACCCCAAGCCCGAAAACCTCCTCCGAGACTGAAAGCTTTCGCGTTG 
CTTTTTCTCGCGTGCCTGGAACCTGACGCACGCGCACTCCAGTTTTTTTATGACCTACGATTT 
TGCAATCTGGGCTTCTTGTTCACTCCACTGCCTCTATCCATTGAGTACTGTATCGATATTGTT 
TTTTAAGATTAATATATTTCAGGTATTTAATACGA 



WO 01/40466 



PCT/US00/32678 



418/550 

FIGURE 418 

MTKARLFRLWLVLGSVFMILLIIVYWDSAGAAHFYLHTSFSRPHTGPPLPTPGPDRDRELTAD 
SDVDEFLDKFLSAGVKQSDLPRKETEQPPAPGSMEESVRGYDWSPRDARRSPDQGRQQAERRS 
VLRGFCANSSLAFPTKERAFDDIPNSELSHLIVDDRHGAIYCYVPKVACTNWKRVMIVLSGSL 
LHRGAPYRDPLRIPREHVHNASAHLTFNKFWRRYGKLSRHLMKVKLKKYTKFLFVRDPFVRLI 
SAFRSKFELENEEFYRKFAVPMLRLYANHTSLPASAREAFRAGLKVSFANFIQYLLDPHTEKL 
APFNEHWRQVYRLCHPCQIDYDFVGKLETLDEDAAQLLQLLQVDRQLRFPPSYRNRTASSWEE 
DW FAKI PLAWRQQL YKL YEAD FVL FG Y PKPENLLRD 

Important features : 
Signal peptide: 

amino acids 1-31 

N-glycosylation sites . 

amino acids 134-137, 209-212, 280-283 and 370-373 

TNER/NGFR family cysteine-rich region protein 

amino acids 329-332 
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FIGURE 419 

GGCACGAGGCTGAACCCAGCCGGCTCCATCTCAGCTTCTGGTTTCTAAGTCCATGTGCCAAAG 

GCTGCCAGGAAGGAGACGCCTTCCTGAGTCCTGGATCTTTCTTCCTTCTGGAAATCTTTGACT 

GTGGGTAGTTATTTATTTCTGAATAAGAGCGTCCACGCATC ATG GACCTCGCGGGACTGCTGA 

AGTCTCAGTTCCTGTGCCACCTGGTCTTCTGCTACGTCTTTATTGCCTCAGGGCTAATCATCA 

ACACCATTCAGCTCTTCACTCTCCTCCTCTGGCCCATTAACAAGCAGCTCTTCCGGAAGATCA 

ACTGCAGACTGTCCTATTGCATCTCAAGCCAGCTGGTGATGCTGCTGGAGTGGTGGTCGGGCA 

CGGAATGCACCATCTTCACGGACCCGCGCGCCTACCTCAAGTATGGGAAGGAAAAT.GCCATCG 

TGGTTCTCAACCACAAGTTTGAAATTGACTTTCTGTGTGGCTGGAGCCTGTCCGAACGCTTTG 

GGCTGTTAGGGGGCT.CCAAGGTCCTGGCCAAGAAAGAGCTGGCCTATGTCCCAATTATCGGCT 

GGATGTGGTACTTCACCGAGATGGTCTTCTGTTCGCGCAAjGTGGGAGCAGGATCGCAAGACGG 

TTGCCACCAGTTTGCAGCACCTCCGGGACTACCCCGAGAAGTATTTTTTCCTGATTCACTGTG 

AGGGCACACGGTTCACGGAGAAGAAGCATGAGATCAGCATGCAGGTGGCCCGGGCCAAGGGGC 

TGCCTCGCCTCAAGCATCACCTGTTGCCACGAACCAAGGGCTTCGCCATCACCGTGAGGAGCT 

TGAGAAATGTAGTTTCAGCTGTATATGACTGTACACTCAATTTCAGAAATAATGAAAATCCAA 

CACTGCTGGGAGTCCTAAACGGAAAGAAATACCATGCAGATTTGTATGTTAGGAGGATCCCAC 

TGGAAGACATCCCTGAAGACGATGACGAGTGCTCGGCCTGGCTGCACAAGCTCTACCAGGAGA 

AGGATGCCTTTCAGGAGGAGTACTACAGGACGGGCACCTTCCCAGAGACGCCCATGGTGCCCC 

CCCGGCGGCCCTGGACCCTCGTGAACTGGCTGTTTTGGGCCTCGCTGGTGCTCTACCCTTTCT 

TCCAGTTCCTGGTCAGCATGATCAGGAGCGGGTCTTCCCTGACGCTGGCCAGCTTCATCCTCG 

TCTTCTTTGTGGCCTCCGTGGGAGTTCGATGGATGATTGGTGTGACGGi\AATTGACAAGGGCT 

CTGCCTACGGCAACTCTGACAGCAAGCAGAAACTGAATGACTGACTCAGGGAGGTGTCACCAT 

CCGAAGGGAACCTTGGGGAACTGGTGGCCTCTGCATATCCTCCTTAGTGGGACACGGTGACAA 

AGGCTGGGTGAGCCCCTGCTGGGCACGGCGGAAGTCACGACCTCTCCAGCCAGGGAGTCTGGT 

CTCAAGGCCGGATGGGGAGGAAGATGTTTTGTAATCTTTTTTTCCCCATGTGCTTTAGTGGGC 

TTTGGTTTTCTTTTTGTGCGAGTGTGTGTGAGAATGGCTGTGTGGTGAGTGTGAACTTTGTTC 

TGTGATCATAGAAAGGGTATTTTAGGCTGCAGGGGAGGGCAGGGCTGGGGACCGAAGGGGACA 

AGTTCCCCTTTCATCCTTTGGTGCTGAGTTTTCTGTAACCCTTGGTTGCCAGAGATAAAGTGA 

AAAGTGCTTTAGGTGAGATGACTAAATTATGCCTCCAAGAAAAAAAAATTAAAGTGCTTTTCT 

GGGTCAAAAAAAAAAAA 
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FIGURE 420 

MDLAGLLKSQFLCHLVFCYVFIASGLIINTIQLFTLLLWPINKQLFRKINCRLSYCISSQLVM 
LLEWWSGTECTIFTDPRAYLKYGKENAIVVLNHKFEIDFLCGWSLSERFGLLGGSKVLAKKEL 
AYVPIIGWMWYFTEMVFCSRKWEQDRKTVATSLQHLRDYPEKYFFLIHCEGTRFTEKKHEISM 
QVARAKGLPRLKHHLLPRTKGFAITVRSLRNVVSAVYDCTLNFRNNENPTLLGVLNGKKYHAD 
LYVRRIPLEDIPEDDDECSAWLHKLYQEKDAFQEEYYRTGTFPETPMVPPRRPWTLVNWLFWA 
SLVLY PFFQ FLVSMI RSGS S LTLAS FI LVFFVAS VGVRWMI GVTEI DKGSAYGN S DSKQKLN D 
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FIGURE 421 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGGTGCCTGCATC 
GCCATGGACACCACCAGGTACAGCAAGTGGGGCGGCAGCTCCGAGGAGGTCCCCGGAGGGCCC 
TGGGGACGCTGGGTGCACTGGAGCAGGAGACCCCTCTTCTTGGCCCTGGCTGTCCTGGTCACC 
ACAGTCCTTTGGGCTGTGATTCTGAGTATCCTATTGTCCAAGGCCTCCACGGAGCGCGCGGCG 
CTGCTTGACGGCCACGACCTGCTGAGGACAAACGCCTCGAAGCAGACGGCGGCGCTGGGTGCC 
CTGAAGGAGGAGGTCGGAGACTGCCACAGCTGCTGCTCGGGGACGCAGGCGCAGCTGCAGACC 
ACGCGCGCGGAGCTTGGGGAGGCGCAGGCGAAGCTGATGGAGCAGGAGAGCGCCCTGCGGGAA 
CTGCGTGAGCGCGTGACCCAGGGCTTGGCTGAAGCCGGCAGGGGCCGTGAGGACGTCCGCACT 
GAGCTGTTCCGGGCGCTGGAGGCCGTGAGGCTCCAGAACAACTCCTGCGAGCCGTGCCCCACG 
TCGTGGCTGTCCTTCGAGGGCTCCTGCTACTTTTTCTCTGTGCCAAAGACGACGTGGGCGGCG 
GGGCAGGATCACTGCGCAGATGCCAGCGCGCACCTGGTGATCGTTGGGGGCCTGGATGAGCAG 
GGCTTCCTCACTCGGAACACGCGTGGCCGTGGTTACTGGCTGGGCCTGAGGGCTGTGCGCCAT 
CTGGGCAAGGTTCAGGGCTACCAGTGGGTGGACGGAGTCTCTCTCAGCTTCAGCCACTGGAAC 
CAGGGAGAGCCCAATGACGCTTGGGGGCGCGAGAACTGTGTCATGATGCTGCACACGGGGCTG 
TGGAACGACGCACCGTGTGACAGCGAGAAGGACGGCTGGATCTGTGAGAAAAGGCACAACTGC 
TGACCCCGCCCAGTGCCCTGGAGCCGCGCCCATTGCAGCATGTCGTATCCTGGGGGCTGCTCA 
CCTCCCTGGCTCCTGGAGCTGATTGCCAAAGAGTTTTTTTCTTCCTCATCCACCGCTGCTGAG 
TCTCAGAAACACTTGGCCCAACATAGCCCTGTCCAGCCCAGTGCCTGGGCTCTGGGACCTCCA 
TGCCGACCTCATCCTAACTCCACTCACGCAGACCCAACCTAACCTCCACTAGCTCCAAAATCC 
CTGCTCCTGCGTCCCCGTGATATGCCTCCACTTCTCTCCCTAACCAAGGTTAGGTGACTGAGG 
ACTGGAGCTGTTTGGTTTTCTCGCATTTTCCACCAAACTGGAAGCTGTTTTTGCAGCCTGAGG 
AAGCATCAATAAATATTTGAGAAATGAAAAAA 
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FIGURE 422 

MDTTRYSKWGGSSEEVPGGPWGRWVHWSRRPLFLALAVLVTTVLWAVILSILLSKASTERAAL 
LDGHDLLRTNASKQTAALGALKEEVGDCHSCCSGTQAQLQTTRAELGEAQAKLMEQESALREL 
RERVTQGLAEAGRGREDVRTELFRALEAVRLQNNSCEPCPTSWLSFEGSCYFFSVPKTTWAAA 
QDHCADASAHLVIVGGLDEQGFLTRNTRGRGYWLGLRAVRHLGKVQGYQWVDGVSLSFSHWNQ 
GEPNDAWGRENCVMMLHTGLWNDAPCDSEKDGWICEKRHNC 

Important features: 

Type II transmembrane domain: 

amino acids 31-54 

N-glycosylation sites. 

amino acids 73-76 and 159-162 

Leucine zipper pattern. 

amino acids 102-123 

N-myristoylation sites. 

amino acids 18-23, 133-138 and 242-247 

C-type lectin domain signature. 

amino acids 264-287 
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FIGURE 423 

GCGCCGCCAGGCGTAGGCGGGGTGGCCCTTGCGTCTCCCGCTTCCTTGAAAAACCCGGCGGGC 
GAGCGAGGCTGCGGGCCGGCCGCTGCCCTTCCCCACACTCCCCGCCGAGAAGCCTCGCTCGGC 
GCCCAACATGGCGGGTGGGCGCTGCGGCCCGCAGCTAACGGCGCTCCTGGCCGCCTGGATCGC 
GGCTGTGGCGGCGACGGCAGGCCCCGAGGAGGCCGCGCTGCCGCCGGAGCAGAGCCGGGTCCA 
GCCCATGACCGCCTCCAACTGGACGCTGGTGATGGAGGGCGAGTGGATGCTGAAATTTTACGC 
CCCATGGTGTCCATCCTGCCAGCAGACTGATTCAGAATGGGAGGCTTTTGCAAAGAATGGTGA 
AATACTTCAGATCAGTGTGGGGAAGGTAGATGTCATTCAAGAACCAGGTTTGAGTGGCCGCTT 
CTTTGTCACCACTCTCCCAGCATTTTTTCATGCAAAGGATGGGATATTGCGCCGTTATCGTGG 
CCCAGGAATCTTCGAAGACCTGCAGAATTATATCTTAGAGAAGAAATGGCAATCAGTCGAGCC 
TCTGACTGGCTGGAAATCCCCAGCTTCTCTAACGATGTCTGGAATGGCTGGTCTTTTTAGCAT 
CTCTGGCAAGATATGGCATCTTCACAACTATTTCACAGTGACTCTTGGAATTCCTGCTTGGTG 
TTCTTATGTGTTTTTCGTCATAGCCACCTTGGTTTTTGGCCTT.TTTATGGGTCTGGTCTTGGT 
GGTAATATCAGAATGTTTCTATGTGCCACTTCCAAGGCATTTATCTGAGCGTTCTGAGCAGAA 
TCGGAGATCAGAGGAGGCTCATAGAGCTGAACAGTTGCAGGATGCGGAGGAGGAAAAAGATGA 
TTCAAATGAAGAAGAAAACAAAGACAGCCTTGTAGATGATGAAGAAGAGAAAGAAGATCTTGG 
CGATGAGGATGAAGCAGAGGAAGAAGAGGAGGAGGACAACTTGGCTGCTGGTGTGGATGAGGA 
GAGAAGTGAGGCCAATGATCAGGGGCCCCCAGGAGAGGACGGTGTGACCCGGGAGGAAGTAGA 
GCCTGAGGAGGCTGAAGAAGGCATCTCTGAGCAACCCTGCCCAGCTGACACAGAGGTGGTGGA 
AGACTCCTTGAGGCAGCGTAAAAGTCAGCATGCTGACAAGGGACTGTAGATTTAATGATGCGT 
TTTCAAGAATACACACCAAAACAATATGTCAGCTTCCCTTTGGCCTGCAGTTTGTACCAAATC 
CTTAATTTTTCCTGAATGAGCAAGCTTCTCTTAAAAGATGCTCTCTAGTCATTTGGTCTCATG 
GCAGTAAGCCTCATGTATACTAAGGAGAGTCTTCCAGGTGTGACAATCAGGATATAGAAAAAC 
AAACGTAGTGTTGGGATCTGTTTGGAGACTGGGATGGGAACAAGTTCATTTACTTAGGGGTCA 
GAGAGTCTCGACCAGAGGAGGCCATTCCCAGTCCTAATCAGCACCTTCCAGAGACAAGGCTGC 
AGGCCCTGTGAAATGAAAGCCAAGCAGGAGCCTTGGCTGCTGAGCATCCCCAAAGTGTAACGT 
AGAAGCCTTGCATCCTTTTCTTGTGTAAAGTAT.TTATTTTTGTCAAATTGCAGGAAACATCAG 
GCACCACAGTGCATGAAAAATCTTTCACAGCTAGAAATTGAAAGGGCCTTGGGTATAGAGAGC 
AGCTCAGAAGTCATCCCAGCCCTCTGAATCTCCTGTGCTATGTTTTATTTCTTACCTTTAATT 
TTTCCAGCATTTCCACCATGGGCATTCAGGCTCTCCACACTCTTCACTATTATCTCTTGGTCA 
GAGGACTCCAATAACAGCCAGGTTTACATGAACTGTGTTTGTTCATTCTGACCTAAGGGGTTT 
AGATAATCAGTAACCATAACCCCTGAAGCTGTGACTGCCAAACATCTCAAATGAAATGTTGTG 
GCCATCAGAGACTCAAAAGGAAGTAAGGATTTTACAAGACAGATTAAAAAAAAATTGTTTTGT 
CCAAAATATAGTTGTTGTTGATTTTTTTTTAAGTTTTCTAAGCAATATTTTTCAAGCCAGAAG 
TCCTCTAAGTCTTGCCAGTACAAGGTAGTCTTGTGAAGAAAAGTTGAATACTGTTTTGTTTTC 
ATCTCAAGGGGTTCCCTGGGTCTTGAACTACTTTAATAATAACTAAAT^AACCACTTCTGATTT 
TCCTTCAGTGATGTGCTTTTGGTGAAAGAATTAATGAACTCCAGTACCTGAAAGTGAAAGATT 
TGATTTTGTTTCCATCTTCTGTAATCTTCCAAAGAATTATATCTTTGTAAATCTCTCAATACT 
CAATCTACTGTAAGTACCCAGGGAGGCTAATTTCTTT 
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FIGURE 424 

MAGGRCGPQLTALLAAWIAAVAATAGPEEAALPPEQSRVQPMTASNWTLVMEGEWMLKFYAPW 
CPSCQQTDSEWEAFAKNGEILQISVGKVDVIQEPGLSGRFFVTTLPAFFHAKDGIFRRYRGPG 
IFEDLQNYILEKKWQSVEPLTGWKSPASLTMSGMAGLFSISGKIWHLHNYFTVTLGIPAWCSY 
VFFVIATLVFGLFMGLVLWISECFYVPLPRHLSERSEQNRRSEEAHRAEQLQDAEEEKDDSN 
EEENKDSLVDDEEEKEDLGDEDEAEEEEEEDNLAAGVDEERSEANDQGPPGEDGVTREEVEPE 
EAEEGISEQPCPADTEWEDSLRQRKSQHADKGL 



Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 191-211 

N-glycosylation site. 

amino acids 4 6-4 9 

Thioredoxin family proteins . 

isomerase) 

amino acids 56-72 

Flavodoxin proteins 

amino acids 173-187 



(homologous region to disulfide 
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FIGURE 425 

GAGGAACCTACCGGTACCGGCCGCGCGCTGGTAGTCGCCGGTGTGGCTGCACCTCACCAATCCCGTGCGCCGCGG 
CTGGGCCGTCGGAGAGTGCGTGTGCTTCTCTCCTGCACGCGGTGCTTGGGCTCGGCCAGGCGGGGTCCGCCGCCA 
GGGTTTGAGGATGGGGGAGTAGCTACAGGAAGCGACCCCGCGATGGCAAGGTATATTTTTGTGGAATGAAAAGGA 
AGTATTAGAAATGAGCTGAAGACCATTCACAGATTAATATTTTTGGGGACAGATTTGTGATGCTTGATTCACCCT 
TGAAGTAATGTAGACAGAAGTTCTCAAATTTGCATATTACATCAACTGGAACCAGCAGTGAATCTTAATGTTCAC 
TTAAATCAGAACTTGCATAAGAAAGAGAATCGGAGTCTGGTTAAATAAAGATGACTATATCAGAGACTTGAAAAG 
GATCATTCTCTGTTTTCTGATAGTGTATATGGCCATTTTAGTGGGCACAGATCAGGATTTTTACAGTTTACTTGG 
AGTGTCCAAAACTGCAAGCAGTAGAGAAATAAGACAAGCTTTCAAGAAATTGGCATTGAAGTTACATCCTGATAA 
AAACCCGAATAACCCAAATGCACATGGCGATTTTTTAAAAATAAATAGAGCATATGAAGTACTCAAAGATGAAGA 
TCTACGGAAAAAGTATGACAAATATGGAGAAAAGGGACTTGAGGATAATCAAGGTGGCCAGTATGAAAGCTGGAA 
CTATTATCGTTATGATTTTGGTATTTATGATGATGATCCTGAAATCATAACATTGGAAAGAAGAGAATTTGATGC 
TGCTGTTAATTCTGGAGAACTGTGGTTTGTAAATTTTTACTCCCCAGGCTGTTCACACTGCCATGATTTAGCTCC 
CACATGGAGAGACTTTGCTAAAGAAGTGGATGGGTTACTTCGAATTGGAGCTGTTAACTGTGGTGATGATAGAAT 
GCTTTGCCGAATGAAAGGAGTCAACAGCTATCCCAGTCTCTTCATTTTTCGGTCTGGAATGGCCCCAGTGAAATA 
TCATGGAGACAGATCAAAGGAGAGTTTAGTGAGTTTTGCAATGCAGCATGTTAGAAGTACAGTGACAGAACTTTG 
GACAGGAAATTTTGTCAACTCCATACAAACTGCTTTTGCTGCTGGTATTGGCTGGCTGATCACTTTTTGTTCAAA 
AGGAGGAGATTGTTTGACTTCACAGACACGACTCAGGCTTAGTGGCATGTTGTTTCTCAACTCATTGGATGCTAA 
AGAAATATATTTGGAAGTAATACATAATCTTCCAGATTTTGAACTACTTTCGGCAAACACACTAGAGGATCGTTT 
GGCTCATCATCGGTGGCTGTTATTTTTTCATTTTGGAAAAAATGAAAATTCAAATGATCCTGAGCTGAAAAAACT 
AAAAACTCTACTTAAAAATGATCATATTCAAGTTGGCAGGTTTGACTGTTCCTCTGCACCAGACATCTGTAGTAA 
TCTGTATGTTTTTCAGCCGTCTCTAGCAGTATTTAAAGGACAAGGAACCAAAGAATATGAAATTCATCATGGAAA 
GAAGATTCTATATGATATACTTGCCTTTGCCAAAGAAAGTGTGAATTCTCATGTTACCACGCTTGGACCTCAAAA 
TTTTCCTGCCAATGACAAAGAACCATGGCTTGTTGATTTCTTTGCCCCCTGGTGTCCACCATGTCGAGCTTTACT 
ACCAGAGTTACGAAGAGCATCAAATCTTCTTTATGGTCAGCTTAAGTTTGGTACACTAGATTGTACAGTTCATGA 
GGGACTCTGTAACATGTATAACATTCAGGCTTATCCAACAACAGTGGTATTCAACCAGTCCAACATTCATGAGTA 
TGAAGGACATCACTCTGCTGAACA7\ATCTTGGAGTTCATAGAGGATCTTATGAATCCTTCAGTGGTCTCCCTTAC 
ACCCACCACCTTCAACGAACTAGTTACACAAAGAAAACACAACGAAGTCTGGATGGTTGATTTCTATTCTCCGTG 
GTGTCATCCTTGCCAAGTCTTAATGCCAGAATGGAAAAGAATGGCCCGGACATTAACTGGACTGATCAACGTGGG 
CAGTATAGATTGCCAACAGTATCATTCTTTTTGTGCCCAGGAAAACGTTCAAAGATACCCTGAGATAAGATTTTT 
TCCCCCAAAATCAAATAAAGCTTATCAGTATCACAGTTACAATGGTTGGAATAGGGATGCTTATTCCCTGAGAAT 
CTGGGGTCTAGGATTTTTACCTCAAGTATCCACAGATCTAACACCTCAGACTTTCAGTGAAAAAGTTCTACAAGG 
GAAAAATCATTGGGTGATTGATTTCTATGCTCCTTGGTGTGGACCTTGCCAGAATTTTGCTCCAGAATTTGAGCT 
CTTGGCTAGGATGATTAAAGGAAAAGTGAAAGCTGGAAAAGTAGACTGTCAGGCTTATGCTCAGACATGCCAGAA 
AGCTGGGATCAGGGCCTATCCAACTGTTAAGTTTTATTTCTACGAAAGAGCAAAGAGAAATTTTCAAGAAGAGCA 
GATAAATACCAGAGATGCAAAAGCAATCGCTGCCTTAATAAGTGAAAAATTGGAAACTCTCCGAAATCAAGGCAA 
GAGGAATAAGGATGAACTTTGATAATGTTGAAGATGAAGAAAAAGTTTAAAAGAAATTCTGACAGATGACATCAG 
AAGACACCTATTTAGAATGTTACATTTATGATGGGAATGAATGAACATTATCTTAGACTTGCAGTTGTACTGCCA 
GAATTATCTACAGCACTGGTGTAAAAGAAGGGTCTGCAAACTTTTTCTGTAAAGGGCCGGTTTATAAATATTTTA 
GACTTTGCAGGCTATAATATATGGTTCACACATGAGAACAAGAATAGAGTCATCATGTATTCTTTGTTATTTGCT 
TTTAACAACCTTTAAAAAATATTAAAACGATTCTTAGCTCAGAGCCATACAAAAGTAGGCTGGATTCAGTCCATG 
GACCATAGATTGCTGTCCCCCTCGACGGACTTATAATGTTTCAGGTGGCTGGCTTGAACATGAGTCTGCTGTGCT 
ATCTACATAAATGTCTAAGTTGTATAAAGTCCACTTTCCCTTCACGTTTTTTGGCTGACCTGAAAAGAGGTAACT 
TAGTTTTTGGTCACTTGTTCTCCTAAAAATGCTATCCCTAACCATATATTTATATTTCGTTTTAAAAACACCCAT 
GATGTGGCACAGTAAACAAACCCTGTTATGCTGTATTATTATGAGGAGATTCTTCATTGTTTTCTTTCCTTCTCA 
AAGGTTGAAAAAATGCTTTTAATTTTTCACAGCCGAGAAACAGTGCAGCAGTATATGTGCACACAGTAAGTACAC 
AAATTTGAGCAACAGTAAGTGCACAAATTCfGTAGTTTGCTGTATCATCCAGGAAAACCTGAGGGAAAAAAATTA 
TAGCAATTAACTGGGCATTGTAGAGTATCCTAAATATGTTATCAAGTATTTAGAGTTCTATATTTTAAAGATATA 
TGTGTTCATGTATTTTCTGAAATTGCTTTCATAGAAATTTTCCCACTGATAGTTGATTTTTGAGGCATCTAATAT 
TTACATATTTGCCTTCTGAACTTTGTTTTGACCTGTATCCTTTATTTACATTGGGTTTTTCTTTCATAGTTTTGG 
TTTTTCACTCCTGTCCAGTCTATTTATTATTCAAATAGGAAAAATTACTTTACAGGTTGTTTTACTGTAGCTTAT 
AATGATACTGTAGTTATTCCAGTTACTAGTTTACTGTCAGAGGGCTGCCTTTTTCAGATA7\ATATTGACATAATA 
ACTGAAGTTATTTTTATAAGAAAATCAAGTATATAAATCTAGGAAAGGGATCTTCTAGTTTCTGTGTTGTTTAGA 
CTCAAAGAATCACAAATTTGTCAGTAACATGTAGTTGTTTAGTTATAATTCAGAGTGTACAGAATGGTAAAAATT 
CCAATCAGTCAAAAGAGGTCAATGAATTAAAAGGCTTGCAACTTTTTCAAAAAAAAAAAAAAAAA 
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FIGURE 426 

MGVWLNKDDYIRDLKRIILCFLIVYMAILVGTDQDFYSLLGVSKTASSREIRQAFKKLALKLH 
PDKNPNNPNAHGDFLKINRAYEVLKDEDLRKKYDKYGEKGLEDNQGGQYESWNYYRYDFGIYD 
DDPEIITLERREFDAAVNSGELWFVNFYSPGCSHCHDLAPTWRDFAKEVDGLLRIGAVNCGDD 
RMLCRMKGVNSYPSLFIFRSGMAPVKYHGDRSKESLVSFAMQHVRSTVTELWTGNFVNSIQTA 
FAAGIGWLITFCSKGGDCLTSQTRLRLSGMLFLNSLDAKEIYLEVIHNLPDFELLSANTLEDR 
LAHHRWLLFFHFGKNENSNDPELKKLKTLLKNDHIQVGRFDCSSAPDICSNLYVFQPSLAVFK 
GQGTKEYEIHHGKKILYDILAFAKESVNSHVTTLGPQNFPANDKEPWLVDFFAPWCPPCRALL 
PELRRASNLLYGQLKFGTLDCTVHEGLCNMYNIQAYPTTWFNQSNIHEYEGHHSAEQILEFI 
EDLMNPSWSLTPTTFNELVTQRKHNEVWMVDFYSPWCHPCQVLMPEWKRMARTLTGLINVGS 
IDCQQYHSFCAQENVQRYPEIRFFPPKSNKAYQYHSYNGWNRDAYSLRIWGLGFLPQVSTDLT 
TQTFSEKVLQGKNHWVIDFYAPWCGPCQNFAPEFELLARMIKGKVKAGKVDCQAYAQTCQKAG 
IRAYPTVKFYFYERAKRNFQEEQINTRDAKAIT^ALISEKLETLRNQGKRNKDEL 

Important features : 

Endoplasmic reticulum targeting sequence. 

amino acids 744-74 7 

Cytochrome c family heme-binding site signature. 

amino acids 158-163 

Nt-dnaJ domain signature. 

amino acids 77-96 



N-glycosylation site. 

amino acids 484-487 
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FIGURE 427 

CTGCAGTCAGGACTCTGGGACCGCAGGGGGCT'CCCGGACCCTGACTCTGCAGCCGAACCGGCA 
CGGTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCTTGA 
GTCCTTCTGAGATGATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGGTAG 
CGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTCAATT 
CCAACGCTATCAAGAACCTGCCCCCAGCGCTGGGCGGCGCTGCGGGGCACCCAGGCTCTGCAG 
TCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACAACTACC 
AGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGTCCCACCC 
GCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACGCTGCATGC 
GTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTCTTCTGATCAAA 
ATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGATCATAGCACCT 
TGGATGGGTATTCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAAAGGACAAGAAG 
GTTCTGTTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTAGACACTTCTGGT 
CCAAGATCTGTAAACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCATAGGAGAAAAGGCT 
CTCATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTCTTGCCGGATACAGA 
AAGATCACCATCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGAGACACTA&ACCAGCT 
ATCCAAATGCAGTGAACTCCTTTTATATAATAGATGCTATGAAAACCTTTTATGACCTTCATC 
AACTCAATCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCATTCCAATAACACCTTCCA 
AAAACCTGGAGTGTAAGAGCTTTGTTTCTTTATGGAACTCCCCTGTGATTGCAGTAAATTACT 
GTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAATGCAATGAAAGTTTTAATTATTTTTCT 
AAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTAAATTTTTGTACACATTGATTGTTAT 
CTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCATTTCAGCTTATAGTTCTTAAAAG 
CATAACCCTTTACCCCATTTAATTCTAGAGTCTAGAACGCAAGGATCTCTTGGAATGACAAAT 
GATAGGTACCTAAAATGTAACATGAAAATACTAGCTTATTTTCTGAAATGTACTATCTTAATG 
CTTAAATTATATTTCCCTTTAGGCTGTGATAGTTTTTGAAATAAAATTTAACATTTAAAAAAA 
AAAAAA 
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FIGURE 428 

MMALGAAGATRVFVAMVAAALGGHPLLGVSATLNSVLNSNAIKNLPPPLGGAAGHPGSAVSAA 
PGILYPGGNKYQTIDNYQPYPCAEDEECGTDEYCASPTRGGDAGVQICLACRKRRKRCMRHAM 
CCPGNYCKNGICVSSDQNHFRGEIEETITESFGNDHSTLDGYSRRTTLSSKMYHTKGQEGSVC 
LRSSDCASGLCCARHFWSKICKPVLKEGQVCTKHRRKGSHGLEIFQRCYCGEGLSCRIQKDHH 
QASNSSRLHTCQRH 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 256-259 

Fungal Zn(2)-Cys(6) binuclear cluster domain 

amino acids 110-126 
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FIGURE 429 

gagaggacgaggtgccgctgcctggagaatcctccgctgccgtcggctcccggagcccagccc 
tttcctaacccaacccaacctagcccagtcccagccgccagcgcctgtccctgtcacggaccc 
cagcgttaccatgcatcctgccgtcttcctatccttacccgacctcagatgctcccttctgct 
cctggtaacttgggtttttactcctgtaacaactgaaataacaagtcttgctacagagaatat 
agatgaaattttaaacaatgctgatgttgctttagtaaatttttatgctgactggtgtcgttt 
cagtcagatgttgcatccaatttttgaggaagcttccgatgtcattaaggaagaatttccaaa 
tgaaaatcaagtagtgtttgccagagttgattgtgatcagcactctgacatagcccagagata 
caggataagcaaatacccaaccctcaaattgtttcgtaatgggatgatgatgaagagagaata 
caggggtcagcgatcagtgaaagcattggcagattacatcaggcaacaaaaaagtgaccccat 
tcaagaaattcgggacttagcagaaatcaccactcttgatcgcagcaaaagaaatatcattgg 
atattttgagcaaaaggactcggacaactatagagtttttgaacgagtagcgaatattttgca 
tgatgactgtgcctttctttctgcatttggggatgtttcaaaaccggaaagatatagtggcga 
caacataatctacaaaccaccagggcattctgctccggatatggtgtacttgggagctatgac 
aaattttgatgtgActtacaattggattcaagataaatgtgttcctcttgtccgagaaataac 
atttgaaaatggagaggaattgacagaagaaggactgccttttctcatactctttcacatgaa 
agaagatacagaaagtttagaaatattccagaatgaagtagctcggcaattaataagtgaaaa 
aggtacaataaactttttacatgccgattgtgacaaatttagacatcctcttctgcacataca 
gaaaactccagcagattgtcctgtaatcgctattgacagctttaggcatatgtatgtgtttgg 
agacttcaaagatgtattaattcctggaaaactcaagcaattcgtatttgacttacattctgg 
aaaactgcacagagaattccatcatggacctgacccaactgatacagccccaggagagcaagc 
ccaagatgtagcaagcagtccacctgagagctccttccagaaactagcacccagtgaatatag 
gtatactctattgagggatcgagatgagctttaaaaacttgaaaaacagtttgtaagcctttc 
aacagcagcatcaacctacgtggtggaaatagtaaacctatattttcataattctatgtgtat 
ttttattttgaataaacagaaagaaatttaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaaaaaaaaa 
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FIGURE 430 

MHPAVFLSLPDLRCSLLLLVTWVFTPVTTEITSLATENIDEILNNADVALVNFYADWCRFSQM 
LHPIFEEASDVIKEEFPNENQVVFARVDCDQHSDIAQRYRISKYPTLKLFRNGMMMKREYRGQ 
RSVKALADYIRQQKSDPIQEIRDLAEITTLDRSKRNIIGYFEQKDSDNYRVFERVANILHDDC 
AFLSAFGDVSKPERYSGDNIIYKPPGHSAPDMVYLGAMTNFDVTYNWIQDKCVPLVREITFEN 
GEELTEEGLPFLILFHMKEDTESLEIFQNEVARQLISEKGTINFLHADCDKFRHPLLHIQKTP 
ADCPVIAIDSFRHMYVFGDFKDVLIPGKLKQFVFDLHSGKLHREFHHGPDPTDTAPGEQAQDV 
ASSPPESSFQKLAPSEYRYTLLRDRDEL 

Important features: 
Signal peptide: 

amino acids 1-29 

Endoplasmic reticulum targeting sequence. 

amino acids 403-406 

Tyrosine kinase phosphorylation site. 

amino acids 203-211 

Thioredoxin family proteins 

amino acids 50-66 
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FIGURE 431 

GAGCAGGACGGAGCCATGGACCCCGCCAGGAAAGCAGGTGCCCAGGCCATGATCTGGACTGCA 
GGCTGGCTGCT.GCTGCTGCTGCTTCGCGGAGGAGCGCAGGCCCTGGAGTGCTACAGCTGCGTG 
CAGAAAGCAGATGACGGATGCTCCCCGAACAAGATGAAGACAGTGAAGTGCGCGCCGGGCGTG 
GACGTCTGCACCGAGGCCGTGGGGGCGGTGGAGACCATCCACGGACAATTCTCGCTGGCAGTG 
CGGGGTTGCGGTTCGGGACTCCCCGGGAAGAATGACCGCGGCCTGGATCTTCACGGGCTTCTG 
GCGTTCATCCAGCTGCAGCAATGCGCTCAGGATCGCTGCAACGCCAAGCTCAACCTCACCTCG 
CGGGCGCTCGACCCGGCAGGTAATGAGAGTGCATACCCGCCCAACGGCGTGGAGTGCTACAGC 
TGTGTGGGCCTGAGCCGGGAGGCGTGCCAGGGTACATCGCCGCCGGTCGTGAGCTGCTACAAC 
GCCAGCGATCATGTCTACAAGGGCTGCTTCGACGGCAACGTCACCTTGACGGCAGCTAATGTG 
ACTGTGTCCTTGCCTGTCCGGGGCTGTGTCCAGGATGAATTCTGCACTCGGGATGGAGTAACA 
GGCCCAGGGTTCACGCTCAGTGGCTCCTGTTGCCAGGGGTCCCGCTGTAACTCTGACCTCCGC 
AACAAGACCTACTTCTCCCCTCGAATCCCACCCCTTGTCCGGCTGCCCCCTCCAGAGCCCACG 
ACTGTGGCCTCAACCACATCTGTCACCACTTCTACCTCGGCCCCAGTGAGACCCACATCCACC 
ACCAAACCCATGCCAGCGCCAACCAGTCAGACTCCGAGACAGGGAGTAGAACACGAGGCCTCC 
CGGGATGAGGAGCCCAGGTTGACTGGAGGCGCCGCTGGCCACCAGGACCGCAGCAATTCAGGG 
CAGTATCCTGCAAAAGGGGGGCCCCAGCAGCCCCATAATAAAGGCTGTGTGGCTCCCACAGCT 
GGATTGGCAGCCCTTCTGTTGGCCGTGGCTGCTGGTGTCCTACTGTGAGCTTCTCCACCTGGA 
AATTTCCCTCTCACCTACTTCTCTGGCCCTGGGTACC.CCTCTTCTCATCACTTCCTGTTCCCA 
CCACTGGACTGGGCTGGCCCAGCCCCTGTTTTTCCAACATTCCCCAGTATCCCCAGCTTCTGC 
TGCGCTGGTTTGCGGCTTTGGGAAATAAAATACCGTTGTATATATTCTGCCAGGGGTGTTCTA 
GCTTTTTGAGGACAGCTCCTGTATCCTTCTCATCCTTGTCTCTCCGCTTGTCCTCTTGTGATG 
TTAGGACAGAGTGAGAGAAGTCAGCTGTCACGGGGAAGGTGAGAGAGAGGATGCTAAGCTTCC 
TACTCACTTTCTCCTAGCCAGCCTGGACTTTGGAGCGTGGGGTGGGTGGGACAATGGCTCCCC 
ACTCTAAGCACTGCCTCCCCTACTCCCCGCATCTTTGGGGAATCGGTTCCCCATATGTCTTCC 
TTACTAGACTGTGAGCTCCTCGAGGGGGGGCCCGGTACCCAATTCGCCCTATAGTGAGTCGTA 
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FIGURE 432 

MDPARKAGAQAMIWTAGWLLLLLLRGGAQALECYSCVQKADDGCSPNKMKTVKCAPGVDVCTE 
AVGAVETIHGQFSLAVRGCGSGLPGKNDRGLDLHGLLAFIQLQQCAQDRCNAKLNLTSRALDP 
AGNESAYPPNGVECYSCVGLSREACQGTSPPVVSCYNASDHVYKGCFDGNVTLTAANVTVSLP 
VRGCVQDEFCTRDGVTGPGFTLSGSCCQGSRCNSDLRNKTYFSPRIPPLVRLPPPEPTTVAST 
TSVTTSTSAPVRPTSTTKPMPAPTSQTPRQGVEHEASRDEEPRLTGGAAGHQDRSNSGQYPAK 
GGPQQPHNKGCVAPTAGLAALLLAVAAGVLL 
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FIGURE 433 

CGGGACTCGGCGGGTCCTCCTGGGAGTCTCGGAGGGGACCGGCTGTGCAGACGCCATGGAGTT 
GGTGCTGGTCTTCCTCTGCAGCCTGCTGGCCCCCATGGTCCTGGCCAGTGCAGCTGAAAAGGA 
GAAGGAAATGGACCCTTTTCATTATGATTACCAGACCCTGAGGATTGGGGGACTGGTGTTCGC 
TGTGGTCCTCTTCTCGGTTGGGATCCTCCTTATCCTAAGTCGCAGGTGCAAGTGCAGTTTCAA 
TCAGAAGCCCCGGGCCCCAGGAGATGAGGAAGCCCAGGTGGAGAACCTCATCACCGCCAATGC 
AACAGAGCCCCAGAAGCAGAGAACTGAAGTGCAGCCATCAGGTGGAAGCCTCTGGAACCTGAG 
GCGGCTGCTTGAACCTTTGGATGCAAATGTCGATGCTTAAGAAAACCGGCCACTTCAGCAACA 
GCCCTTTCCCCAGGAGAAGCCAAGAACTTGTGTGTCCCCCACCCTATCCCCTCTAACACCATT 
CCTCCACCTGATGATGCAACTAACACTTGCCTCCCCACTGCAGCCTGCGGTCCTGCCCACCTC 
CCGTGATGTGTGTGTGTGTGTGTGTGTGTGACTGTGTGTGTTTGCTAACTGTGGTCTTTGTGG 
CTACTTGTTTGTGGATGGTATTGTGTTTGTTAGTGAACTGTGGACTCGCTTTCCCAGGCAGGG 
GCTGAGCCACATGGCCATCTGCTCCTCCCTGCCCCCGTGGCCCTCCATCACCTTCTGCTCCTA 
GGAGGCTGCTTGTTGCCCGAGACCAGCCCCCTCCCCTGATTTAGGGATGCGTAGGGTAAGAGC 
ACGGGCAGTGGTCTTCAGTCGTCTTGGGACCTGGGAAGGTTTGCAGCACTTTGTCATCATTCT 
TCATGGACTCCTTTCACTCCTTTAACAAAAACCTTGCTTCCTTATCCCACCTGATCCCAGTCT 
GAAGGTCT-CTTAGCAACTGGAGATACAAAGCAAGGAGCTGGTGAGCCCAGCGTTGACGTCAGG 
CAGGCTATGCCCTTCCGTGGTTAATTTCTTCCCAGGGGCTTCCACGAGGAGTCCCCATCTGCC 
CCGCCCCTTCACAGAGCGCCCGGGGATTCCAGGCCCAGGGCTTCTACTCTGCCCCTGGGGAAT 
GTGTCCCCTGCATATCTTCTCAGCAATAACTCCATGGGCTCTGGGACCCTACCCCTTCCAACC 
TTCCCTGCTTCTGAGACTTCAATCTACAGCCCAGCTCATCCAGATGCAGACTACAGTCCCTGC 
AATTGGGTCTCTGGCAGGCAATAGTTGAAGGACTCCTGTTCCGTTGGGGCCAGCACACCGGGA 
TGGATGGAGGGAGAGCAGAGGCCTTTGCTTCTCTGCCTACGTCCCCTTAGATGGGCAGCAGAG 
GCAACTCCCGCATCCTTTGCTCTGCCTGTCGGTGGTCAGAGCGGTGAGCGAGGTGGGTTGGAG 
ACTCAGCAGGCTCCGTGCAGCCCTTGGGAACAGTGAGAGGTTGAAGGTCATAACGAGAGTGGG 
AACTCAACCCAGATCCCGCCCCTCCTGTCCTCTGTGTTCCCGCGGAAACCAACCAAACCGTGC 
GCTGTGACCCATTGCTGTTCTCTGTATCGTGATCTATCCfTCAACAACAACAGAAAAAAGGAAT 
AAAATATCCTTTGTTTCCT 
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FIGURE 434 

MELVLVFLCSLLAPMVLASAAEKEKEMDPFHYDYQTLRIGGLVFAWLFSVGILLILSRRCKC 
SFNQKPRAPGDEEAQVENLITANATEPQKQRTEVQPSGGSLWNLRRLLEPLDANVDA 
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FIGURE 435 

GGTCCTTAATGGCAGCAGCCGCCGCTACCAAGATCCTTCTGTGCCTCCCGCTTCTGCTCCTGC 
TGTCCGGCTGGTCCCGGGCTGGGCGAGCCGACCCTCACTCTCTTTGCTATGACATCACCGTCA 
TCCCTAAGTTCAGACCTGGACCACGGTGGTGTGCGGTTCAAGGCCAGGTGGATGAAAAGACTT 
TTCTTCACTATGACTGTGGCAACAAGACAGTCACACCTGTCAGTCCCCTGGGGAAGAAACTAA 
ATGTCACAACGGCCTGGAAAGCACAGAACCCAGTACTGAGAGAGGTGGTGGACATACTTACAG 
AGCAACTGCGTGACATTCAGCTGGAGAATTACACACCCAAGGAACCCCTCACCCTGCAGGCAA 
GGATGTCTTGTGAGCAGAAAGCTGAAGGACACAGCAGTGGATCTTGGCAGTTCAGTTTCGATG 
GGCAGATCTTCCTCCTCTTTGACTCAGAGAAGAGAATGTGGACAACGGTTCATCCTGGAGCCA 
GAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTGTGGCCATGTCCTTCCATTACTTCTCAA 
TGGGAGACTGTATAGGATGGCTTGAGGACTTCTTGATGGGCATGGACAGCACCCTGGAGCCAA 
GTGCAGGAGCACCACTCGCCATGTCCTCAGGCACAACCCAACTCAGGGCCACAGCCACCACCC 
TCATCCTTTGCTGCCTCCTCATCATCCTCCCCTGCTTCATCCTCCCTGGCATCTGAGGAGAGT 
CCTTTAGAGTGACAGGTTAAAGCTGATACCAAAAGGCTCCTGTGAGCACGGTCTTGATCAAAC 
TCGCCCTTCTGTCTGGCCAGCTGCCCACGACCTACGGTGTATGTCCAGTGGCCTCCAGCAGAT 
CATGATGACATCATGGACCCAATAGCTCATTCACTGCCTTGATTCCTTTTGCCAACAATTTTA 
CCAGCAGTTATACCTAACATATTATGCAATTTTCTCTTGGTGCTACCTGATGGAATTCCTGCA 
CTTAAAGTTCTGGCTGACTAAACAAGATATATCATTTTCTTTCTTCTCTTTTTGTTTGGAAAA 
TCAAGTACTTCTTTGAATGATGATCTCTTTCTTGCAAATGATATTGTCAGTAAAATAATCACG 
TTAGACTTCAGACCTCTGGGGATTCTTTCCGTGTCCTGAAAGAGAATTTTTAAATTATTTAAT 
AAGAAAAAATTTATATTAATGATTGTTTCCTTTAGTAATTTATTGTTCTGTACTGATATTTAA 
ATAAAGAGTTCTATTTCCCAAAAAAAAAAAAAAAAAA 
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FIGURE 436 

MT^AAAATKILLCLPLLLLLSGWSRAGRADPHSLCYDITVIPKFRPGPRWCAVQGQVDEKTFLH 
YDCGNKTVTPVSPLGKKLNVTTAWKAQNPVLREWDILTEQLRDIQLENYTPKEPLTLQARMS 
CEQKAEGHSSGSWQFSFDGQIFLLFDSEKRMWTTVHPGARKMKEKWENDKVVAMSFHY.FSMGD 
CIGWLEDFLMGMDSTLEPSAGAPLAMSSGTTQLRATATTLILCCLLIILPCFILPGI 
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FIGURE 437 

GTTCTCCTTTCCGAGCCAAAATCCCAGGCGATGGTGAATTATGAACGTGCCACACCATGAAGCTCTTGTGGCAGG 
TAACTGTGCACCACCACACCTGGAATGCCATCCTGCTCCCGTTCGTCTACCTCACGGCGCAAGTGTGGATTCTGT 
GTGCAGCCATCGCTGCTGCCGCCTCAGCCGGGCCCCAGAACTGCCCCTCCGTTTGCTCGTGCAGTAACCAGTTCA 
GCAAGGTGGTGTGCACGCGCCGGGGCCTCTCCGAGGTCCCGCAGGGTATTCCCTCGAACACCCGGTACCTCAACC 
TCATGGAGAACAACATCCAGATGATCCAGGCCGACACCTTCCGCCACCTCCACCACCTGGAGGTCCTGCAGTTGG 
GCAGGAACTCCATCCGGCAGATTGAGGTGGGGGCCTTCAACGGCCTGGCCAGCCtCAACACCCTGGAGCTGTTCG 
ACAACTGGCTGACAGTCATCCCTAGCGGGGCCTTTGAATACCTGTCCAAGCTGCGGGAGCTCTGGCTTCGCAACA 
ACCCCATCGAAAGCATCCCCTCTTACGCCTTCAACCGGGTGCCCTCCCTCATGCGCCTGGACTTGGGGGAGCTCA 
AGAAGCTGGAGTATATCTCTGAGGGAGCTTTTGAGGGGCTGTTCAACCTCAAGTATCTGAACTTGGGCATGTGCA 
ACATTAAAGACATGCCCAATCTCACCCCCCTGGTGGGGCTGGAGGAGCTGGAGATGTCAGGGAACCACTTCCCTG 
AGATCAGGCCTGGCTCCTTCCATGGCCTGAGCTCCCTCAAGAAGCTCTGGGTCATGAACTCACAGGTCAGCCTGA 
TTGAGCGGAATGCTTTTGACGGGCTGGCTTCACTTGTGGAACTCAACTTGGCCCACAATAACCTCTCTTCTTTGC 
CCCATGACCTCTTTACCCCGCTGAGGTACCTGGTGGAGTTGCATCTACACCACAACCCTTGG7\ACTGTGATTGTG 
ACATTCTGTGGCTAGCCTGGTGGCTTCGAGAGTATATACCCACCAATTCCACCTGCTGTGGCCGCTGTCATGCTC 
CCATGCACATGCGAGGCCGCTACCTCGTGGAGGTGGACCAGGCCTCCTTCCAGTGCTCTGCCCCCTTCATCATGG 
ACGCACCTCGAGACCTCAACATTTCTGAGGGTCGGATGGCAGAACTTAAGTGTCGGACTCCCCCTATGTCCTCCG 
TGAAGTGGTTGCTGCCCAATGGGACAGTGCTCAGCCACGCCTCCCGCCACCCAAGGATCTCTGTCCTCAACGACG 
GCACCTTGAACTTTTCCCACGTGCTGCTTTCAGACACTGGGGTGTACACATGCATGGTGACCAATGTTGCAGGCA 
ACTCCAACGCCTCGGCCTACCTCAATGTGAGCACGGCTGAGCTTAACACCTCCAACTACAGCTTCTTCACCACAG 
TAACAGTGGAGACCACGGAGATCTCGCCTGAGGACACAACGCGAAAGTACAAGCCTGTTCCTACCACGTCCACTG 
GTTACCAGCCGGCATATACCACCTCTACCACGGTGCTCATTCAGACTACCCGTGTGCCCAAGCAGGTGGCAGTAC 
CCGCGACAGACACCACTGACAAGATGCAGACCAGCCTGGATGAAGTCATGAAGACCACCAAGATCATCATTGGCT 
GCTTTGTGGCAGTGACTCTGCTAGCTGQCGCCATGTTGATTGTCTTCTATAAACTTCGTAAGCGGCACCAGCAGC 
GGAGTACAGTCACAGCCGCCCGGACTGTTGAGATAATCCAGGTGGACGAAGACATCCCAGCAGCAACATCCGCAG 
CAGCAACAGCAGCTCCGTCCGGTGTATCAGGTGAGGGGGCAGTAGTGCTGCCCACAATTCATGACCATATTAACT 
ACAACACCTACAAACCAGCACATGGGGCCCACTGGACAGAAAACAGCCTGGGGAACTCTCTGCACCCCACAGTCA 
CCACTATCTCTG7\ACCTTATATAATTCAGACCCATACCAAGGACAAGGTACAGGAAACTCAAATATGACTCCCCT 
.CCCCCAAAAAACTTATAAAATGCAATAGAATGCACACAAAGACAGCAACTTTTGTACAGAGTGGGGAGAGACTTT 
TTCTTGTATATGCTTATATATTAAGTCTATGGGCTGGTTAAAAAAAACAGATTATATTAAAATTTAAAGACAAAA 
AGTCAAAACA 
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FIGURE 438 

MKLLWQVTVHHHTWNAILLPFVYLTAQVWILCAAIAAAASAGPQNCPSVCSCSNQFSKVVCTR 
RGLSEVPQGIPSNTRYLNLMENNIQMIQADTFRHLHHLEVLQLGRNSIRQIEVGAFNGLASLN 
TLELFDNWLTVIPSGAFEYLSKLRELWLRNNPIESIPSYAFNRVPSLMRLDLGELKKLEYISE 
GAFEGLFNLKYLNLGMCNIKDMPNLTPLVGLEELEMSGNHFPEIRPGSFHGLSSLKKLWVMNS 
QVSLIERNAFDGLASLVELNLAHNNLSSLPHDLFTPLRYLVELHLHHNPWNCDCDILWLAWWL 
REYIPTNSTCCGRCHAPMHMRGRYLVEVDQASFQCSAPFIMDAPRDLNISEGRMAELKCRTPP 
MSSVKWLLPNGTVLSHASRHPRISVLNDGTLNFSHVLLSDTGVYTCMVTNVAGNSNASAYLNV 
STAELNTSNYSFFTTVTVETTEISPEDTTRKYKPVPTTSTGYQPAYTTSTTVLIQTTRVPKQV 
AVPATDTTDKMQTSLDEVMKTTKIIIGCFVAVTLLAAAMLIVFYKLRKRHQQRSTVTAARTVE 
IIQVDEDIPAATSAAATAAPSGVSGEGAVVLPTIHDHINYNTYKPAHGAHWTENSLGNSLHPT 
VTTISEPYIIQTHTKDKVQETQI 
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FIGURE 439 

GTCGAATCCAAATCACTCATTGTGAAAGCTGAGCTCACAGCCGAATAAGCCACCATCAGGCTG 
TCAGTGTGTCTCCTGATGGTCTCGCTGGCCCTTTGCTGCTACCAGGCCCATGCTCTTGTCTGC 
CCAGCTGTTGCTTCTGAGATCACAGTCTTCTTATTCTTAAGTGACGCTGCGGTAAACCTCCAA 
GTTGCCAAACTTAATCCACCTCCAGAAGCTCTTGCAGCCAAGTTGGAAGTGAAGCACTGCACC 
GATCAGATATCTTTTAAGAAACGACTCTCATTGAAAAAGTCCTGGTGGAAATAGTGAAAAAAT 
GTGGTGTGTGACATGTAAAAATGCTCAACCTGGTTTCCAAAGTCTTTCT^ACGACACCCTGATC 
TTCACTAAAAATTGTAAAGGTTTCAACACGTTGCTTTAATAAATCACTTGCCCTGC 
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FIGURE 440 

MRLSVCLLMVSLALCCYQAHALVCPAVASEITVFLFLSDAAVNLQVAKLNPPPEALAAKLEVK 
HCT DQ I S FKKRLSLKKSWWK 
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FIGURE 441 

-GAACATTTTTAGTTCCCAAGGAATGTACATCAGCCCCACGGAAGCTAGGCCACCTCTGGGATG 
GGGTTGCTGGTTTAAAACAAACGCCAGTCATCCTATATAAGGACCTGACAGCCACCAGGCACC 
ACCTCCGCCAGGAACTGCAGGCCCACCTGTCTGCAACCCAGCTGAGGCC ATG CCCTCCCCAGG 
GACCGTCTGCAGCCTCCTGCTCCTCGGCATGCTCTGGCTGGACTTGGCCATGGCAGGCTCCAG 
CTTCCTGAGCCCTGAACACCAGAGAGTCCAGCAGAGAAAGGAGTCGAAGAAGCCACCAGCCAA 
GCTGCAGCCCCGAGCTCTAGCAGGCTGGCTCCGCCCGGAAGATGGAGGTCAAGCAGAAGGGGC 
AGAGGATGAACTGGAAGTCCGGTTCAACGCCCCCTTTGATGTTGGAATCAAGCTGTCAGGGGT 
TCAGTACCAGCAGCACAGCCAGGCCCTGGGGAAGTTTCTTCAGGACATCCTCTGGGAAGAGGC 
CAAAGAGGCCCCAGCCGACAAGT^TCGCCCACAAGCCTTACTCACCTCTCTCTAAGTTTAGA 
AGCGCTCATCTGGCTTTTCGCTTGCTTCTGCAGCAACTCCCACGACTGTTGTACAAGCTCAGG 
AGGCGAATAAATGTTCAAACTGTA 
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FIGURE 442 

MPSPGTVCSLLLLGMLWLDLAMAGSSFLSPEHQRVQQRKESKKPPAKLQPRALAGWLRPEDGG 
QAEGAEDELEVRFNAPFDVGIKLSGVQYQQHSQALGKFLQDILWEEAKEAPADKO 
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FIGURE 443 

CGGCCACAGCTGGCATGCTCTGCCTGATCGCCATCCTGCTGTATGTCCTCGTCCAGTACCTCG 
TGAACCCCGGGGTGCTCCGCACGGACCCCAGATGTCAAGAATATGAACACGTGGCTGCTGTTC 
CTCCCCCTGTTCCCGGTGCAGGTGCAGACCCTGATAGTCGTGATCATCGGGATGCTCGTGCTC 
CTGCTGGACTTTCTTGGCTTGGTGCACCTGGGCCAGCTGCTCATCTTGCACATCTACCTGAGT 
ATGTCCCCCACCCTAAGCCCCCGATCCCCCCAAGGCTGGGTGGTCAGAGCTGCTCATCTTACA 
CCTCTACTTGAGTATGTCCCTAACCCTGAGCCCCCCACGCCTGGGGCCAGAGTCTTTGTCCCC 
CGTGTGCGCATGTGTTCAGGGTCAGCCTCTCCCAGAAGTGAGATCATGGACAAAAAGGGCAAA 
TCACAGGAAGAAATTAAATCCATGAGGACCCAGCAGGCCCAGCAAGAAGCTGAACTCACGCCG 
AGACCTGCAGGAGTGGTGCCAGGTGCTTGAAGTAACAAGTTTAAAATGTTCAGAGACAATGGA 
ATGGAATCTATTAGGCAAGAACAGGACATTATGAAATAAGGACAGGTGGACTTCCAAAAACAC 
AAGTAGAAATTCTAACAATGAAATATATTACAGGCAGGTCACCCACTAACCAAACAACTGAAG 
CGAGAGCTGTGGTCTTGCTTGGTCTCACAGTGGGCACAGCGGTAGGCGGTCAGTCATGTTGCT 
GAACGACGGAGGGTAAACTCCCCAGCCCCAAGAAAACCTGTGTTGGAAGTAACAACAACCTCC 
CTGCTCCTGGCACCAGCCGTTTTGGTCATGGTGGGCCAGCTGCAAAGCGTCTTCCATTCTCTG 
GGCAGTGGTGGCCCCGAGGCTGTGGCCTCTCAGGGGGTTTCTGTGGACACGGGCAGCAGAGTG 
TGTCCAGGCCAGCCCCCAAGAATGCCCTGCTCCTGACAGCTTGGCCAACCCCTGGTCAGGGCA 
GAGGGAGTTGGGTGGGTCAGGCTCTGGGCTCACCTCCATCTCCAGAGCATCCCCTGCCTGCAG 
TTGTGGCAAGAACGCCCAGCTCAGAATGAACACACCCCACCAAGAGCCTCCTTGTTCATAACC 
ACAGGTTACCCTACAAACCACTGTCCCCACACAACCCTGGGGATGTTTTAAAACACACACCTC 
TAACGCATATCTTACAGTCACTGTTGTCTTGCCTGAGGGTTGAATTTTTTTTAATGAAAGTGC 
AATGAAAATCACTGGATTAAATCCTACGGACACAGAGCTGAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAA 
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FIGURE 444 

MNTWLLFLPLFPVQVQTLIWIIGMLVLLLDFLGLVHLGQLLIFHIYLSMSPTLSPRSPQGWV 
VRAAHLT PLLE YVPN PE PPT PGARVFVPRVRMC SGS AS PR SE IM DKKGKSQEE I KSMRTQQAQ 
QEAELTPRPAGWPGA 
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FIGURE 445 

AGGCGGGCAGCAGCTGCAGGCTGACCTTGCAGCTTGGCGGAATGGACTGGCCTCACAACCTGC 
TGTTTCTTCTTACCATTTCCATCTTCCTGGGGCTGGGCCAGCCCAGGAGCCCCAAAAGCAAGA 
GGAAGGGGCAAGGGCGGCCTGGGCCCCTGGCCCCTGGCCCTCACCAGGTGCCACTGGACCTGG 
TGTCACGGATGAAACCGTATGCCCGCATGGAGGAGTATGAGAGGAACATCGAGGAGATGGTGG 
CCCAGCTGAGGAACAGCTCAGAGCTGGCCCAGAGAAAGTGTGAGGTCAACTTGCAGCTGTGGA 
TGTCCAACAAGAGGAGCCTGTCTCCCTGGGGCTACAGCATCAACCACGACCCCAGCCGTATCC 
CCGTGGACCTGCCGGAGGCACGGTGCCTGTGTCTGGGCTGTGTGAACCCCTTCACCATGCAGG 
AGGACCGCAGCATGGTGAGCGTGCCGGTGTTCAGCCAGGTTCCTGTGCGCCGCCGCCTCTGCC 
CGCCACCGCCCCGCACAGGGCCTTGCCGCCAGCGCGCAGTCATGGAGACCATCGCTGTGGGCT 
GCACCTGCATCTTCTGAATCACCTGGCCCAGAAGCCAGGCCAGCAGCCCGAGACCATCCTCCT 
TGCACCTTTGTGCCAAGAAAGGCCTATGAAAAGTAAACACTGACTTTTGAAAGCAAG 
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FIGURE 446 

MDWPHNLLFLLTISIFLGLGQPRSPKSKRKGQGRPGPLAPGPHQVPLDLVSRMKPYARMEEYE 
RNIEEMVAQLRNSSELAQRKCEVNLQLWMSNKRSLSPWGYSINHDPSRIPVDLPEARCLCLGC 
VNPFTMQEDRSMVSVPVFSQVPVRRRLCPPPPRTGPCRQRAVMETIAVGCTCIF 

Important features: 
Signal peptide : 

amino acids 1-20 

N-glycosylation site. 

amino acids 75-78 



Homologous region to IL-17 

amino acids 96-180. 
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FIGURE 447 

GGAGTGCAGATGGCATCCTTCGGTTCTTCCAGACAAGCTGCAAGACGCTGACCATGGCCAAGA 
TGGAGCTCTCGAAGGCCTTCTCTGGCCAGCGGACACTCCTATCTGCCATCCTCAGCATGCTAT 
CACTCAGCTTCTCCACAACATCCCTGCTCAGCAACTACTGGTTTGTGGGCACACAGAAGGTGC 
CCAAGCCCCTGTGCGAGAAAGGTCTGGCAGCCAAGTGCTTTGACATGCCAGTGTCCCTGGATG 
GAGATACCAACACATCCACCCAGGAGGTGGTACAATACAACTGGGAGACTGGGGATGACCGGT 
TCTCCTTCCGGAGCTTCCGGAGTGGCATGTGGCTATCCTGTGAGGAAACTGTGGAAGAACCAG 
GGGAGAGGTGCCGAAGTTTCATTGAACTTACACCACCAGCCAAGAGAGGTGAGAAAGGACTAC 
TGGAATTTGCCACGTTGCAAGGCCCATGTCACCCCACTCTCCGATTTGGAGGGAAGCGGTTGA 
TGGAGAAGGCTTCCCTCCCCTCCCCTCCCTTGGGGCTTTGTGGCAAAAATCCTATGGTTATCC 
CTGGGAACGCAGATCACCTACATCGGACTTCAATTCATCAGCTTCCTCCTGCTACTAACAGAC 
TTGCTACTCACTGGGAACCCTGCCTGTGGGCTCAAACTGAGCGCCTTTGCTGCTGTTTCCTCT 
GTCCTGTCAGGTCTCCTGGGGATGGTGGCCCACATGATGTATTCACAAGTCTTCCAAGCGACT 
GTCAACTTGGGTCCAGAAGACTGGAGACCACATGTTTGGAATTATGGCTGGGCCTTCTACATG 
GCCTGGCTCTCCTTCACCTGCTGCATGGCGTCGGCTGTCACCACCTTCAACACGTACACCAGG 
ATGGTGCTGGAGTTCAAGTGCAAGCATAGTAAGAGCTTCAAGGAAAACCCGAACTGCCTACCA 
CATCACCATCAGTGTTTCCCTCGGCGGCTGTCAAGTGCAGCCCCCACCGTGGGTCCTTTGACC 
AGCTACCACCAGTATCATAATCAGCCCATCCACTCTGTCTCTGAGGGAGTCGACTTCTACTCC 
GAGCTGCGGAACAAGGGATTTCAAAGAGGGGCCAGCCAGGAGCTGAAAGAAGCAGTTAGGTCA 
TCTGTAGAGGAAGAGCAGTGTTAGGAGTTAAGCGGGTTTGGGGAGTAGGCTTGAGCCCTACCT 
TACACGTCTGCTGATTATCAACATGTGCTTAAGCCAACATCCGTCTCTTGAGCATGGTTTTTA 
GAGGCTACGAATAAGGCTATGAATAAGGGTTATCTTTAAGTCCTAAGGGATTCCTGGGTGCCA 
CTGCTCTCTTTTCCTCTACAGCTCCATCTTGTTTCACCCACCCCACATCTCACACATCCAGAA 
TTCCCTTCTTTACTGATAGTTTCTGTGCCAGGTTCTGGGCTAAACCATGGAGATAAAAAGAAG 
AGTAAAAT ACACT TCCC G ACCT TAAGG AT CT GAAA 
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FIGURE 448 

MAKMELSKAFSGQRTLLSAILSMLSLSFSTTSLLSNYWFVGTQKVPKPLCEKGLAAKCFDMPV 
SLDGDTNTSTQEVVQYNWETGDDRFSFRSFRSGMWLSCEETVEEPGERCRSFIELTPPAKRGE 
KGLLEFATLQGPCHPTLRFGGKRLMEKASLPSPPLGLCGKNPMVIPGNADHLHRTSIHQLPPA 
TNRLATHWEPCLWAQTERLCCCFLCPVRSPGDGGPHDVFTSLPSDCQLGSRRLETTCLELWLG 
LLHGLALLHLLHGVGCHHLQHVHQDGAGVQVQA 
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FIGURE 449 

CCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCC 
ACGCGTCCGGTGCAAGCTCGCGCCGCACACTGCCTGGTGGAGGGAAGGAGCCCGGGCGCCTCTCGCCGCTCCCCG 
CGCCGCCGTCCGCACCTCCCCACCGCCCGCCGCCCGCCGCCCGCCGCCCGCAAAGCATGAGTGAGCCCGCTCTCT 
GCAGCTGCCCGGGGCGCGAATGGCAGGCTGTTTCCGCGGAGTAAAAGGTGGCGCCGGTCAGTGGTCGTTTCCAAT 
GACGGACATT7VACCAGACTGTCAGATCCTGGGGAGTCGCGAGCCCCGAGTTTGGAGTTTTTTCCCCCCACAACGT 
CACAGTCCGAACTGCAGAGGGAAAGGAAGGCGGCAGGAAGGCGAAGCTCGGGCTCCGGCACGTAGTTGGGAAACT 
TGCGGGTCCTAGAAGTCGCCTCCCCGCCTTGCCGGCCGCCCTTGCAGCCCCGAGCCGAGCAGCAAAGTGAGACAT 
TGTGCGCCTGCCAGATCCGCCGGCCGCGGACCGGGGCTGCCTCGGAAACACAGAGGGGTCTTCTCTCGCCCTGCA 
TATAATTAGCCTGCACACAAAGGGAGCAGCTGAATGGAGGTTGTCACTCTCTGGAAAAGGATTTCTGACCGAGCG 
CTTCCAATGGACATTCTCCAGTCTCTCTGGAAAGATTCTCGCTAATGGATTTCCTGCTGCTCGGTCTCTGTCTAT 
ACTGGCTGCTGAGGAGGCCCTCGGGGGTGGTCTTGTGTCTGCTGGGGGCCTGCTTTCAGATGCTGCCCGCCGCCC 
CCAGCGGGTGCCCGCAGCTGTGCCGGTGCGAGGGGCGGCTGCTGTACTGCGAGGCGCTCAACCTCACCGAGGCGC 
CCCACAACCTGTCCGGCCTGCTGGGCTTGTCCCTGCGCTACAACAGCCTCTCGGAGCTGCGCGCCGGCCAGTTCA 
CGGGGTTAATGCAGCTCACGTGGCTCTATCTGGATCACAATCACATCTGCTCCGTGCAGGGGGACGCCTTTCAGA 
AACTGCGCCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCAACACCACCTTCCGGCCCA 
TGCCCAACCTGCGCAGCGTGGACCTCTCGTACAACAAGCTGCAGGCGCTCGCGCCCGACCTCTTCCACGGGCTGC 
GGAAGCTCACCACGCTGCATATGCGGGCCAACGCCATCCAGTTTGTGCCCGTGCGCATCTTCCAGGACTGCCGCA 
GCCTCAAGTTTCTCGACATCGGATACAATCAGCTCAAGAGTCTGGCGCGCAACTCTTTCGCCGGCTTGTTTAAGC 
TCACCGAGCTGCACCTCGAGCACAACGACTTGGTCAAGGTGAACTTCGCCCACTTCCCGCGCCTCATCTCCCTGC 
ACTCGCTCTGCCTGCGGAGGAACAAGGTGGCCATTGTGGTCAGCTCGCTGGACTGGGTTTGGAACCTGGAGAAAA 
TGGACTTGTCGGGCAACGAGATCGAGTACATGGAGCCCCATGTGTTCGAGACCGTGCCGCACCTGCAGTCCCTGC 
AGCTGGACTCCAACCGCCTCACCTACATCGAGCCCCGGATCCTCAACTCTTGGAAGTCCCTGACAAGCATCACCC 
TGGCCGGGAACCTGTGGGATTGCGGGCGCAACGTGTGTGCCCTAGCCTCGTGGCTCAGCAACTTCCAGGGGCGCT 
ACGATGGCAACTTGCAGTGCGCCAGCCCGGAGTACGCACAGGGCGAGGACGTCCTGGACGCCGTGTACGCCTTCC 
ACCTGTGCGAGGATGGGGCCGAGCCCACCAGCGGCCACCTGCTCTCGGCCGTCACCAACCGCAGTGATCTGGGGC 
CCCCTGCCAGCTCGGCCACCACGCTCGCGGACGGCGGGGAGGGGCAGCACGACGGCACATTCGAGCCTGCCACCG 
TGGCTCTTCCAGGCGGCGAGCACGCCGAGAACGCCGTGCAGATCCACAAGGTGGTCACGGGCACCATGGCCCTCA 
TCTTCTCCTTCCTCATCGTGGTCCTGGTGCTCTACGTGTCCTGGAAGTGTTTCCCAGCCAGCCTCAGGCAGCTCA 
GACAGTGCTTTGTCACGCAGCGCAGGAAGCAAAAGCAGAAACAGACCATGCATCAGATGGCTGCCATGTCTGCCC 
AGGAATACTACGTTGATTACAAACCGAACCACATTGAGGGAGCCCTGGTGATCATCAACGAGTATGGCTCGTGTA 
CCTGCCACCAGCAGCCCGCGAGGGAATGCGAGGTGTGATTGTCCCAGTGGCTCTCAACCCATGCGCTACCAAATA 
CGCCTGGGCAGCCGGGACGGGCCGGCGGGCACCAGGCTGGGGTCTCCTTGTCTGTGCTCTGATATGCTCCTTGAC 
TGAAACTTTAAGGGGATCTCTCCCAGAGACTTGACATTTTAGCTTTATTGTGTCTTAAAAACAAAAGCGAATTAA 
AACACAACAAAAAACCCCACCCCACAACCTTCAGGACAGTCTATCTTAAATTTCATATGAGAACTCCTTCCTCCC 
TTTGAAGATCTGTCCATATTCAGGAATCTGAGAGTGTAAAAAAGGTGGCCATAAGACAGAGAGAGAATAATCGTG 
CTTTGTTTTATGCTACTCCTCCCACCCTGCCCATGATTAAACATCATGTATGTAGAAGATCTTAAGTCCATACGC 
ATTTCATGAAGAACCATTGGAAAGAGGAATCTGCAATCTGGGAGCTTTVAGAGCAAATGATGACCATAGAAAGCTA 
TGTTCTTACTTTGTGTGTGTGTCTGTATGTTTCTGCGTTGTGTGTCTTTGTAGGCAAGCAAACGTTGTCTACACA 
AACGGGAATTTAGCTCACATCATTTCATGCCCCTGTGCCTCTAGCTCTGGAGATTGGTGGGGGGAGGTGGGGGGA 
AACGGCAGGAATAAGGGAAAGTGGTAGTTTTAACTAAGGTTTTGTAACACTTGAAATCTTTTCTTTCTCAAATTA 
ATTATCTTTAAGCTTCAAGAAACTTGCTCTGACCCCTCTAAGCAAACTACTAAGCATTTAAAAGAGAATCTAATT 
TTT7VAAGGTGTAGCACCTTTTTTTTTATTCTTCCCACAGAGGGTGCTAATCTCATTATGCTGTGCTATCTGAAAA 
GAACTTAAGGCCACAATTCAGGTCTCGTCCTGGGCATTGTGATGGATTGACCCTCCATTTG.CAGTACCTTCCCAG 
CTGATTAAAGTTCAGCAGTGGTATTGAGGTTTTTCGAATATTTATATAGAAAAAAAGTCTTTTCACATGACAAAT 
GACACTCTCACACCAGTCTTAGCCCTAGTAGTTTTTTAGGTTGGACCAGAGGAAGCAGGTTAAATGAGACCTGTC 
CTCTGCTGCACTCAGAAAAAATAGGCAGTCCCTGATGCTCAGATCTTAGCCTTGATATTAATAGTTGAGACCACC 
TACCCACAATGCAGCCTATACTCCCAAGACTACAAAGTTACCATCGCAAAGGAAAGGTTATTCCAGTAAAAGGAA 
ATAGTTTTCTCAACCATTTAAAAATATTCTTCTGAACTCATCAAAGTAGAAGAGCCCCCAACCTTTTCTCTCTGC 
CTTCAAGAAGGCAGACATTTGGTATGATTTAGCATCAACAACACATTTATGAGTATATGTAAGTAATCAGAGGGG 
CAAATGCCACTTGTTATTCCTCCCAAGTTTTCCAAGCAAGTACACACAGATCTCTGGTAGGATTAGGGGCCACTT 
GTGTTTCCGGCTTATTTTAGTCGACTTGTCAGCAAGTTTGATGCCTAGTCTATCTGACATGGCCCAGTAGAACAG 
GGCATTGATGGATCACATGAGATGGTAGAAGGAACATCATCACATACCCCTCTCACAGAGAAAATTATCAAAGAA 
CCAGAAATTATATCTGTTTTGGAGCAAGAGTGTCATAATGTTTCAGGGTAGTCAAAATAAACATAAATTATCTCC 
TCTAGATGAGTGGCGATGTTGGCTGATTTGGGTCTGCCATTGACAGAATGTCAAATAAAAAGGAATTAGCTAGAA 
TATGACCATTAAATGTGCTTCTGAAATATATTTTGAGATAGGTTTAGAATGTCA 
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FIGURE 450 

MDFLLLGLCLYWLLRRPSGWLCLLGACFQMLPAAPSGCPQLCRCEGRLLYCEALNLTEAPHN 
LSGLLGLSLRYNSLSELRAGQFTGLMQLTWLYLDHNHICSVQGDAFQKLRRVKELTLSSNQIT 
QLPNTTFRPMPNLRSVDLSYNKLQALAPDLFHGLRKLTTLHMRANAIQFVPVRIFQDCRSLKF 
LDIGYNQLKSLARNSFAGLFKLTELHLEHNDLVKVNFAHFPRLISLHSLCLRRNKVAIWSSL 
DWVWNLEKMDLSGNEIEYMEPHVFETVPHLQSLQLDSNRLTYIEPRILNSWKSLTSITLAGNL 
WDCGRNVCALASWLSNFQGRYDGNLQCASPEYAQGEDVLDAVYAFHLCEDGAEPTSGHLLSAV 
TNRSDLGPPASSATTLADGGEGQHDGTFEPATVALPGGEHAENAVQIHKWTGTMALIFSFLI 
WLVLYVSWKCFPASLRQLRQCFVTQRRKQKQKQTMHQMAAMSAQEYYVDYKPNHIEGALVII 
NEYGSCTCHQQPARECEV 
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FIGURE 451 

TTGAGCGCAGGTGAGCTCCTGCGCGTTCCGGGGGCGTTCCTCCAGTCACCCTCCCGCCGTTAC 

CCGCGGCGCGCCCGAGGGAGTCTCCTCCAGACCCTCCCTCCCGTTGCTCCAAACTAATACGGA , 

CTGAACGGATCGCTGCGAGGGTGGGAGAGAAAATTAGGGGGAGAAAGGACAGAGAGAGCAACT 

ACCATCCATAGCCAGATAGATTATCTTACACTGAACTGATCAAGTACTTTGAAAATGACTTCG 

AAATTTATCTTGGTGTCCTTCATACTTGCTGCACTGAGTCTTTCAACCACCTTTTCTCTCCAA 

CTAGACCAGCAAAAGGTTCTACTAGTTTCTTTTGATGGATTCCGTTGGGATTACTTATATAAA 

GTTCCAACGCCCCATTTTCATTATATTATGAAATATGGTGTTCACGTGAAGCAAGTTACTAAT 

GTTTTTATTACAT^AAACCTACCCTAACCATTATACTTTGGTAACTGGCCTCTTTGCAGAGAAT 

CATGGGATTGTTGCAAATGATATGTTTGATCCTATTCGGAACAAATCTTTCTCCTTGGATCAC 

ATG/^ATATTTATGATTCCAAGTTTTGGGAAGAAGCGACACCAATATGGATCACAAACCAGAGG 

GCAGGACATACTAGTGGTGCAGCCATGTGGCCCGGAACAGATGTAAAAATACATAAGCGCTTT 

CCTACTCATTACATGCCTTACAATGAGTCAGTTTCATTTGAAGATAGAGTTGCCAAAATTGTT 

GAATGGTTTACGTCAAAAGAGCCCATAAATCTTGGTCTTCTCTATTGGGAAGACCCTGATGAC 

ATGGGCCACCATTTGGGACCTGACAGTCCGCTCATGGGGCCTGTCATTTCAGATATTGACAAG 

AAGTTAGGATATCTCATACAAATGCTGAAAAAGGCAAAGTTGTGGAACACTCTGAACCTAATC 

ATCACAAGTGATCATGGAATGACGCAGTGCTCTGAGGAAAGGTTAATAGAACTTGACCAGTAC 

CTGGATAAAGACCACTATACCCTGATTGATCAATCTCCAGTAGCAGCCATCTTGCCAAAAGAA 

GGTAAATTTGATGAAGTCTATGAAGCACTAACTCACGCTCATCCTAATCTTACTGTTTACAAA 

AAAGAAGACGTTCCAG7\AAGGTGGCATTACAAATACAACAGTCGAATTCAACCAATCATAGCA 

GTGGCTGATGAAGGGTGGCACATTTTACAGAATAAGTCAGATGACTTTCTGTTAGGCAACCAC 

GGTTACGATAATGCGTTAGCAGATATGCATCCAATATTTTTAGCCCATGGTCCTGCCTTCAGA 

AAGAATTTCTCAAAAGAAGCCATGAACTCCACAGATTTGTACCCACTACTATGCCACCTCCTC 

AATATCACTGCCATGCCACACAATGGATCATTCTGGAATGTCCAGGATCTGCTCAATTCAGCA 

ATGCCAAGGGTGGTCCCTTATACACAGAGTACTATACTCCTCCCTGGTAGTGTTAAACCAGCA 

GAATATGACCAAGAGGGGTCATACGCTTATTTCATAGGGGTCTCTCTTGGCAGCATTATAGTG 

ATTGTATTTTTTGTAATTTTCATTAAGCATTTAATTCACAGTCAAATACCTGCCTTACAAGAT 

ATGCATGCTGAAATAGCTCAACCATTATTACAAGCCTAATGTTACTTTGAAGTGGATTTGCAT 

ATTGAAGTGGAGATTCCATAATTATGTCAGTGTTTAAAGGTTTCAAATTCTGGGAAACCAGTT 

CCAAACATCTGCAGAAACCATTAAGCAGTTACATATTTAGGTATACACACACACACACACACA 

CACATACACACACACGGACCAAAATACTTACACCTGCAAAGGAATAAAGATGTGAGAGTATGT 

CTCCATTGTTCACTGTAGCATAGGGATAGATAAGATCCTGCTTTATTTGGACTTGGCGCAGAT 

AATGTATATATTTAGCAACTTTGCACTATGTAAAGTACCTTATATATTGCACTTTAAATTTCT 

CTCCTGATGGGTACTTTAATTTGAAATGCACTTTATGGACAGTTATGTCTTATAACTTGATTG 

AAAATGACAACTTTTTGCACCCATGTCACAGAATACTTGTTACGCATTGTTCAAACTGAAGGA 

AATTTCTAATAATCCCGAATAATGAACATAGAAATCTATCTCCATAAATTGAGAGAAGAAGAA 

GGTGATAAGTGTTGAAAATTAAATGTGATAACCTTTGAACCTTGAATTTTGGAGATGTATTCC 

CAACAGCAGAATGCAACTGTGGGCATTTCTTGTCTTATTTCTTTCCAGAGAACGTGGTTTTCA 

TTTATTTTTCCCTCAAAAGAGAGTCAAATACTGACAGATTCGTTCTAAATATATTGTTTCTGT 

CATAAAATTATTGTGATTTCCTGATGAGTCATATTACTGTGATTTTCATAATAATGAAGACAC 

CATGAATATACTTTTCTTCTATATAGTTCAGCAATGGCCTGAATAGAAGCAACCAGGCACCAT 

CTCAGCAATGTTTTCTCTTGTTTGTAATTATTTGCTCCTTTGAAAATTAAATCACTATTAATT 

ACATTAAAAATCAAATTGGATAAAAAAAAAAAAAAAAAAA 
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FIGURE 452 

MTSKFILVSFILAALSLSTTFSLQLDQQKVLLVSFDGFRWDYLYKVPTPHFHYIMKYGVHVKQ 
VTNVFITKTYPNHYTLVTGLFAENHGIVAN DMFDPIRNKSFSLDHMNI YDSKFWEEATPIWIT 
NQRAGHTSGAAMWPGTDVKIHKRFPTHYMPYNESVSFEDRVAKIVEWFTSKEPINLGLLYWED 
PDDMGHHLGPDSPLMGPVISDIDKKLGYLIQMLKKAKLWNTLNLIITSDHGMTQCSEERLIEL 
DQYLDKDHYTLIDQSPVAAILPKEGKFDEVYEALTHAHPNLTVYKKEDVPERWHYKYNSRIQP 
IIAVADEGWHILQNKSDDFLLGNHGYDNALADMHPIFLAHGPAFRKNFSKEAMNSTDLYPLLC 
HLLNITAMPHNGSFWNVQDLLNSAMPRVVPYTQSTILLPGSVKPAEYDQEGSYPYFIGVSLGS 
IIVIVFFVIFIKHLIHSQIPALQDMHAEIAQPLLQA 

Important features: 
Signal Peptide: 

amino acids 1-22 

Transmembrane Domain: 

amino acids 429-452 

N-glycosylation sites: 

amino acids 101-104, 158-161, 292-295, 329-332, 362-365, 369-372, 
382-385, 389-392 

Somatomedin B Domain: 

amino acids 69-85 



Sulfatase protein Region: 

amino acids' 212-241 
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FIGURE 453 

GGCCGCCTGGAATTGTGGGAGTTGTGTCTGCCACTCGGCTGCCGGAGGCCGAAGGTCCGTGAC 

TATGGCTCCCCAGAGCCTGCCTTCATCTAGGATGGCTCCTCTGGGCATGCTGCTTGGGCTGCT 

GATGGCCGCCTGCTTCACCTTCTGCCTCAGTCATCAGAACCTGAAGGAGTTTGCCCTGACCAA 

CCCAGAGAAGAGCAGCACCAAAGAAACGGAGAGAAAAGAAACCAAAGCCGAGGAGGAGCTGGA 

TGCCGAAGTCCTGGAGGTGTTCCACCCGACGCATGAGTGGCAGGCCCTTCAGCCAGGGCAGGC 

TGTCCCTGCAGGATCCCACGTACGGCTGAATCTTCAGACTGGGGAAAGAGAGGCAAAACTCCA 

ATATGAGGACAAGTTCCGAAATAATTTGAAAGGCAAAAGGCTGGATATCAACACCAACACCTA 

CACATCTCAGGATCTCAAGAGTGCACTGGCAAAATTCAAGGAGGGGGCAGAGATGGAGAGTTC 

AAAGGAAGACAAGGCAAGGCAGGCTGAGGTAAAGCGGCTCTTCCGCCCCATTGAGGAACTGAA 

GAAAGACTTTGATGAGCTGAATGTTGTCATTGAGACTGACATGCAGATCATGGTACGGCTGAT 

CAACAAGTTCAATAGTTCCAGCTCCAGTTTGGAAGAGAAGATTGCTGCGCTCTTTGATCTTGA 

ATATTATGTCCATCAGATGGACAATGCGCAGGACCTGCTTTCCTTTGGTGGTCTTCAAGTGGT 

GATCAATGGGCTGAACAGCACAGAGCCCCTCGTGAAGGAGTATGCTGCGTTTGTGCTGGGCGC 

TGCCTTTTCCAGCAACCCCAAGGTCCAGGTGGAGGCCATCGAAGGGGGAGCCCTGCAGAAGCT 

GCTGGTCATCCTGGCCACGGAGCAGCCGCTCACTGCAAAGAAGAAGGTCCTGTTTGCACTGTG 

CTCCCTGCTGCGCCACTTCCCCTATGCCCAGCGGCAGTTCCTGAAGCTCGGGGGGCTGCAGGT 

CCTGAGGACCCTGGTGCAGGAGAAGGGCACGGAGGTGCTCGCCGTGCGCGTGGTCACACTGCT 

CTACGACCTGGTCACGGAGAAGATGTTCGCCGAGGAGGAGGCTGAGCTGACCCAGGAGATGTC 

CCCAGAGAAGCTGCAGCAGTATCGCCAGGTACACCTCCTGCCAGGCCTGTGGGAACAGGGCTG 

GTGCGAGATCACGGCCCACCTCCTGGCGCTGCCCGAGCATGATGCCCGTGAGAAGGTGCTGCA 

GACACTGGGCGTCCTCCTGACCACCTGCCGGGACCGCTACCGTCAGGACCCCCAGCTCGGCAG 

GACACTGGCCAGCCTGCAGGCTGAGTACCAGGTGCTGGCCAGCCTGGAGCTGCAGGATGGTGA 

GGACGAGGGCTACTTCCAGGAGCTGCTGGGCTCTGTCAACAGCTTGCTGAAGGAGCTGAGATG 

AGGCCCCACACCAGGACTGGACTGGGATGCCGCTAGTGAGGCTGAGGGGTGCCAGCGTGGGTG 

GGCTTCTCAGGCAGGAGGACATCTTGGCAGTGCTGGCTTGGCCATTAAATGGAAACCTGAAGG 

CCAAAAAAAAAAAAAAAAAAAAA^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 454 

MAPQSLPSSRMAPLGMLLGLLMAACFTFCLSHQNLKEFALTNPEKSSTKETERKETKAEEELD 
AEVLEVFHPTHEWQALQPGQAVPAGSHVRLNLQTGEREAKLQYEDKFRNNLKGKRLDINTNTY 
TSQDLKSALAKFKEGAEMESSKEDKARQAEVKRLFRPIEELKKDFDELNVVIETDMQIMVRLI 
NKFNSSSSSLEEKIAALFDLEYYVHQMDNAQDLLSFGGLQVVINGLNSTEPLVKEYAAFVLGA 
AFSSNPKVQVEAIEGGALQKLLVILATEQPLTAKKKVLFALCSLLRHFPYAQRQFLKLGGLQV 
LRTLVQEKGTEVLAVRWTLLYDLVTEKMFAEEEAELTQEMSPEKLQQYRQVHLLPGLWEQGW 
CEITAHLLALPEHDAREKVLQTLGVLLTTCRDRYRQDPQLGRTLASLQAEYQVLASLELQDGE 
DEGYFQELLGSVNSLLKELR 

Important features: 
Signal peptide: 

amino acids 1-29 

Hypothetical YJL126w/YLR351c/yhcX family protein. 

amino acids 364-373 

N-glycosylation site. 

amino acids 193-197, 236-240 

N-myristoylation site. 

amino acids 15-21, 19-25, 234-240, 251-257, 402-408, 451-457 

Homologous region SLS1 protein. 

amino a^ids 68-340 



WO 01/40466 



PCT/US00/32678 



455/550 

FIGURE 455 

GCCCCAGGGAGCAGTGGGTGGTTATAACTCAGGCCCGGTGCCCAGAGCCCAGGAGGAGGCAGT 
GGCCAGGAAGGCACAGGCCTGAGAAGTCTGCGGCTGAGCTGGGAGCAAATCCCCCACCCCCTA 
CCTGGGGGACAGGGCAAGTGAGACCTGGTGAGGGTGGCTCAGCAGGCAGGGAAGGAGAGGTGT 
CTGTGCGTCCTGCACCCACATCTTTCTCTGTCCCCTCCTTGCCCTGTCTGGAGGCTGCTAGAC 
TCCTATCTTCTGAATTCTATAGTGCCTGGGTCTCAGCGCAGTGCCGATGGTGGCCCGTCCTTG 
TGGTTCCTCTCTACCTGGGGAAATAAGGTGCAGCGGCCATGGCTACAGCAAGACCCCCCTGGA 
TGTGGGTGCTCTGTGCTCTGATCACAGCCTTGCTTCTGGGGGTCACAGAGCATGTTCTCGCCA 
ACAATGATGTTTCCTGTGACCACCCCTCTAACACCGTGCCCTCTGGGAGCAACCAGGACCTGG 
GAGCTGGGGCCGGGGAAGACGCCCGGTCGGATGACAGCAGCAGCCGCATCATCAATGGATCCG 
ACTGCGATATGCACACCCAGCCGTGGCAGGCCGCGCTGTTGCTAAGGCCCAACCAGCTCTACT 
GCGGGGCGGTGTTGGTGCATCCACAGTGGCTGCTCACGGCCGCCCACTGCAGGAAGAAAGTTT 
TCAGAGTCCGTCTCGGCCACTACTCCCTGTCACCAGTTTATGAATCTGGGCAGCAGATGTTCC 
AGGGGGTCAAATCCATCCCCCACCCTGGCTACTCCCACCCTGGCCACTCTAACGACCTCATGC 
TCATCAAACTGAACAGAAGAATTCGTCCCACTAAAGATGTCAGACCCATCAACGTCTCCTCTC 
ATTGTCCCTCTGCTGGGACAAAGTGCTTGGTGTCTGGCTGGGGGACAACCAAGAGCCCCCAAG 
TGCACTTCCCTAAGGTCCTCCAGTGCTTGAATATCAGCGTGCTAAGTCAGAAAAGGTGCGAGG 
ATGCTTACCCGAGACAGATAGATGACACCATGTTCTGCGCCGGTGACAAAGCAGGTAGAGACT 
CCTGCCAGGGTGATTCTGGGGGGCCTGTGGTCTGCAATGGCTCCCTGCAGGGACTCGTGTCCT 
GGGGAGATTACCCTTGTGCCCGGCCCAACAGACCGGGTGTCTACACGAACCTCTGCAAGTTCA 
CCAAGTGGATCCAGGAAACCATCCAGGCCAACTCCTGAGTCATCCCAGGACTCAGCACACCGG 
CATCCCCACCTGCTGCAGGGACAGCCCTGACACTCCTTTCAGACCCTCATTCCTTCCCAGAGA 
TGTTGAGAATGTTCATCTCTCCAGCCCCTGACCCCATGTCTCCTGGACTCAGGGTCTGCTTCC 
CCCACATTGGGCTGACCGTGTCTCTCTAGTTGAACCCTGGGAACAATTTCCAAAACTGTCCAG 
GGCG.GGGGTTGCGTCTCAATCTCCCTGGGGCACTTTCATCCTCAAGCTCAGGGCCCATCCCTT 
CTCTGCAGCTCTGACCCAAATTTAGTCCCAGAAATAAACTGAGAAGTGGAAAAAAAAA 



WO 01/40466 



PCT/US00/32678 



456/550 
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MATARPPWMWVLCALITALLLGVTEHVLANNDVSCDHPSNTVPSGSNQDLGAGAGEDARSDDS 
SSRIINGSDCDMHTQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKKVFRVRLGHYSLSPV 
YESGQQMFQGVKSIPHPGYSHPGHSNDLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCLVSG 
WGTTKSPQVHFPKVLQCLNISVLSQKRCEDAYPRQIDDTMFCAGDKAGRDSCQGDSGGPVVCN 
GSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQETIQANS 
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FIGURE 457 

GCAGTCAGAGACTTCCCCTGCCCCTCGCTGGGAAAGAACATTAGGAATGCCTTTTAGTGCCTTGCTTCCTGAACT 
AGCTCACAGTAGCCCGGCGGCCCAGGGCAATCCGACCACATTTCACTCTCACCGCTGTAGGAATCCAGATGCAGG 
CCAAGTACAGCAGCACGAGGGACATGCTGGATGATGATGGGGACACCACCATGAGCCTGCATTCTCAAGCCTCTG 
CCACAACTCGGCATCCAGAGCCCCGGCGCACAGAGCACAGGGCTCCCTCTTCAACGTGGCGACCAGTGGCCCTGA 
CCCTGCTGACTTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTACTACCAGC 
TCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGATTAGGAAATACGTCCCAAGAGTTGCAATCTC 
TTCAAGTCCAGAATATAAAGCTTGCAGGAAGTCTGCAGCATGTGGCTGAAAAACTCTGTCGTGAGCTGTATAACA 
AAGCTGGAGCACACAGGTGCAGCCCTTGTACAGAACAATGGAAATGGCATGGAGACAATTGCTACCAGTTCTATA 
AAGACAGCAAAAGTTGGGAGGACTGTAAATATTTCTGCCTTAGTGAAAACTCTACCATGCTGAAGATAAACAAAC 
AAGAAGACCTGGAATTTGCCGCGTCTCAGAGCTACTCTGAGTTTTTCTACTCTTATTGGACAGGGCTTTTGCGCC 
CTGACAGTGGCAAGGCCTGGCTGTGGATGGATGGAACCCCTTTCACTTCTGAACTGTTCCATATTATAATAGATG 
TCACCAGCCCAAGAAGCAGAGACTGTGTGGCCATCCTCAATGGGATGATCTTCTCAAAGGACTGCAAAGAATTGA 
AGCGTTGTGTCTGTGAGAGAAGGGCAGGAATGGTGAAGCCAGAGAGCCTCCATGTCCCCCCTGAAACATTAGGCG 
AAGGTGACTGATTCGCCCTCTGCAACTACAAATAGCAGAGTGAGCCAGGCGGTGCCAAAGCAAGGGCTAGTTGAG 
ACATTGGGAAATGGAACATAATCAGGAAAGACTATCTCTCTGACTAGTACAAAATGGGTTCTCGTGTTTCCTGTT 
CAGGATCACCAGCATTTCTGAGCTTGGGTTTATGCACGTATTTAACAGTCACAAGAAGTCTTATTTACATGCCAC 
CAACCAACCTCAGAAACCCATAATGTCATCTGCCTTCTTGGCTTAGAGATAACTTTTAGCTCTCTTTCTTCTCAA 
TGTCTAATATCACCTCCCTGTTTTCATGTCTTCCTTACACTTGGTGGAATAAGAAACTTTTTGAAGTAGAGGAAA 
TACATTGAGGTAACATCCTTTTCTCTGACAGTCAAGTAGTCCATCAGAAATTGGCAGTCACTTCCCAGATTGTAC 
CAGCAAATACACAAGGAATTCTTTTTGTTTGTTTCAGTTCATACTAGTCCCTTCCCAATCCATCAGTAAAGACCC 
CATCTGCCTTGTCCATGCCGTTTCCCAACAGGGATGTCACTTGATATGAGAATCTCAAATCTCAATGCCTTATAA 
GCATTCCTTCCTGTGTCCATTAAGACTCTGATAATTGTCTCCCCTCCATAGGAATTTCTCCCAGGAAAGAAATAT 
ATCCCCATCTCCGTTTCATATCAGAACTACCGTCCCCGATATTCCCTTCAGAGAGATTAAAGACCAGAAAAAAGT 
GAGCCTCTTCATCTGCACCTGTT^ATAGTTTCAGTTCCTATTTTCTTCCATTGACCCATATTTATACCTTTCAGGT 
ACTGAAGATTTAATAATAATAAATGTAAATACTGTGAAAAA 
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MQAKYSSTRDMLDDDGDTTMSLHSQASATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLVLLI 
GLAALGLLFFQYYQLSNTGQDTISQMEERLGNTSQELQSLQVQNIKLAGSLQHVAEKLCRELY 
NKAGAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQEDLEFAASQSY 
SEFFYSYWTGLLRPDSGKAWLWMDGTPFTSELFHIIIDVTSPRSRDCVAILNGMIFSKDCKEL 
KRCVCERRAGMVKPESLHVPPETLGEGD 
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FIGURE 459 

GTTGATGGCAAACTTCCTCAAAGGAGGGGCAGAGCCTGCGCAGGGCAGGAGCAGCTGGCCCAC 
TGGCGGCCCGCAACACTCCGTCTCACCCTCTGGGCCCACTGCATCTAGAGGAGGGCCGTCTGT 
GAGGCCACTACCCCTCCAGCAACTGGGAGGTGGGACTGTCAGAAGCTGGCCCAGGGTGGTGGT 
CAGCTGGGTCAGGGACCTACGGCACCTGCTGGACCACC'TCGCCTTCTCCATCGAAGCAGGGAA 
GTGGGAGCCTCGAGCCCTCGGGTGGAAGCTGACCCCAAGCCACCCTTCACCTGGACAGGATGA 
GAGTGTCAGGTGTGCTTCGCCTCCTGGCCCTCATCTTTGCCATAGTCACGACATGGATGTTTA 
TTCGAAGCTACATGAGCTTCAGCATGAAAACCATCCGTCTGCCACGCTGGCTGGCAGCCTCGC 
CCACCAAGGAGATCCAGGTTAAAAAGTACAAGTGTGGCCTCATCAAGCCCTGCCCAGCCAACT 
ACTTTGCGTTTAAAATCTGCAGTGGGGCCGCCAACGTCGTGGGCCCTACTATGTGCTTTGAAG 
AGCGCATGATCATGAGTCCTGTGAAAAAGAATGTGGGCAGAGGCCTAAACATCGCCCTGGTGA 
ATGGAACCACGGGAGCTGTGCTGGGACAGAAGGCATTTGACATGTACTCTGGAGATGTTATGC 
ACCTAGTGAAATTCCTTAAAGAAATTCCGGGGGGTGCACTGGTGCTGGTGGCCTCCTACGACG 
ATCCAGGGACCAAAATGAACGATGAAAGCAGGAAACTCTTCTCTGACTTGGGGAGTTCCTACG 
CAAAACAACTGGGCTTCCGGGACAGCTGGGTCTTCATAGGAGCCAAAGACCTCAGGGGTAAAA 
GCCCCTTTGAGCAGTTCTTAAAGAACAGCCCAGACACAAACAAATACGAGGGATGGCCAGAGC 
TGCTGGAGATGGAGGGCTGCATGCCCCCGAAGCCATTTTAGGGTGGCTGTGGCTCTTCCTCAG 
CCAGGGGCCTGAAGAAGCTCCTGCCTGACTTAGGAGTCAGAGCCCGGCAGGGGCTGAGGAGGA 
GGAGCAGGGGGTGCTGCGTGGAAGGTGCTGCAGGTCCTTGCACGCTGTGTCGCGC.CTCTCCTC 
CTCGGAAACAGAACCCTCCCACAGCACATCCTACCCGGAAGACCAGCCTCAGAGGGTCCTTCT 
GGAACCAGCTGTCTGTGGAGAGAATGGGGTGCTTTCGTCAGGGACTGCTGACGGCTGGTCCTG 
AGGAAGGACAAACTGCCCAGACTTGAGCCCAATTAAATTTTATTTTTGCTGGTTTTGAAAAAA 
AAAAAAAAAAAAAA 
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FIGURE 460 

MRVSGVLRLLALI FAI VTTWMFI RS YMS FSMKT I RL PRWLAAS PTKE I QVKKYKCGLI KPC PA 
NYFAFKICSGAANWGPTMCFEDRMIMSPVKNNVGRGLNIALVNGTTGAVLGQKAFDMYSGDV 
MHLVKFLKEIPGGALVLVASYDDPGTKMNDESRKLFSDLGSSYAKQLGFRDSWVFIGAKDLRG 
KSPFEQFLKNSPDTNKYEGWPELLEMEGCMPPKPF 

Important features: 
Signal peptide: 

amino acids 1-15 

ATP/GTP-binding site motif A (P-loop) • 

amino acids 184-191 

N-glycosylation site. 

amino acids 107-110 
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FIGURE 461 

AAACTCAGCACTTGCCGGAGTGGCTCATTGTTAAGACAAAGGGTGTGCACTTCCTGGCCAGGA 
AACCTGAGCGGTGAGACTCCCAGCTGCCTACATCAAGGCCCCAGGACATGCAGAACCTTCCTC 
TAGAACCCGACCCAGCACCATGAGGTCCTGCCTGTGGAGATGCAGGCACCTGAGGCAAGGCGT 
CCAGTGGTCCTTGCTTCTGGCTGTCCTGGTCTTCTTTCTCTTCGCCTTGCCCTCTTTTATTAA 
GGAGCCTCAAACAAAGCCTTCCAGGCATCAACGCACAGAGAACATTAAAGAAAGGTCTCTACA 
GTCCCTGGCAAAGCCTAAGTCCCAGGCACCCACAAGGGCGAGGAGGACAACCATCTATGCAGA 
GCCAGCGCCAGAGAACAATGCCCTCAACACACAAACCCAGCCCAAGGCCCACACCACCGGAGA 
CAGAGGAAAGGAGGCCAACCAGGCACCGCCGGAGGAGCAGGACAAGGTGCCCCACACAGCACA 
GAGGGCAGCATGGAAGAGCCCAGAAAAAGAGAAAACCATGGTGAACACACTGTCACCCAGAGG 
GCAAGATGCAGGGATGGCCTCTGGCAGGACAGAGGCACAATCATGGAAGAGCCAGGACACAAA 
GACGACCCAAGGAAATGGGGGCCAGACCAGGAAGCTGACGGCCTCCAGGACGGTGTCAGAGAA 
GCACCAGGGCAAAGCGGCAACCACAGCCAAGACGCTCATTCCCAAAAGTCAGCACAGAATGCT 
GGCTCCCACAGGAGCAGTGTCAACAAGGACGAGACAGAAAGGAGTGACCACAGCAGTCATCCC 
ACCTAAGGAGAAGAAACCTCAGGCCACCCCACCCCCTGCCCCTTTCCAGAGCCCCACGACGCA 
GAGAAACCAAAGACTGAAGGCCGCCAACTTCAAATCTGAGCCTCGGTGGGATTTTGAGGAAAA 
ATACAGCTTCGAAATAGGAGGCCTTCAGACGACTTGCCCTGACTCTGTGAAGATCAAAGCCTC 
CAAGTCGCTGTGGCTCCAGAAACTCTTTCTGCCCAACCTCACTCTCTTCCTGGACTCCAGACA 
CTTCAACCAGAGTGAGTGGGACCGCCTGGAACACTTTGCACCACCCTTTGGCTTCATGGAGCT 
CAACTACTCCTTGGTGCAGAAGGTCGTGACACGCTTCCCTCCAGTGCCCCAGCAGCAGCTGCT 
CCTGGCCAGCCTCCCCGCTGGGAGCCTCCGGTGCATCACCTGTGCCGTGGTGGGCAACGGGGG 
CATCCTGAACAACTCCCACATGGGCCAGGAGATAGACAGTCACGACTACGTGTTCCGATTGAG 
CGGAGCTCTCATTAAAGGCTACGAACAGGATGTGGGGACTCGGACATCCTTCTACGGCTTTAC 
CGCCTTCTCCCTGACCCAGTCACTCCTTATATTGGGCAATCGGGGTTTCAAGAACGTGCCTCT 
TGGGAAGGACGTCCGCTACTTGCACTTCCTGGAAGGCACCCGGGACTATGAGTGGCTGGAAGC 
ACTGCTTATGAATCAGACGGTGATGTCAAAAAACCTTTTCTGGTTCAGGCACAGACCCCAGGA 
AGCTTTTCGGGAAGCCCTGCACATGGACAGGTACCTGTTGCTGCACCCAGACTTTCTCCGATA 
CATGAAGAACAGGTTTCTGAGGTCTAAGACGCTGGATGGTGCCCACTGGAGGATATACCGCCC 
CACCACTGGGGCCCTCCTGCTGCTCACTGCCCTTCAGCTCTGTGACCAGGTGAGTGCTTATGG 
CTTCATCACTGAGGGCCATGAGCGCTTTTCTGATCACTACTATGATACATCATGGAAGCGGCT 
GATCTTTTACATAAACCATGACTTCAAGCTGGAGAGAGAAGTCTGGAAGCGGCTACACGATGA 
AGGGATAATCCGGCTGTACCAGCGTCCTGGTCCCGGAACTGCCAAAGCCAAGAACTGACCGGG 
GCCAGGGCTGCCATGGTCTCCTTGCCTGCTCCAAGGCACAGGATACAGTGGGAATCTTGAGAC 
TCTTTGGCCATTTCCCATGGCTCAGACTAAGCTCCAAGCCCTTCAGGAGTTCCAAGGGAACAC 
TTGAACCATGGACAAGACTCTCTCAAGATGGCAAATGGCTAATTGAGGTTCTGAAGTTCTTCA 
GTACATTGCTGTAGGTCCTGAGGCCAGGGATTTTTAATTAAATGGGGTGATGGGTGGCCAATA 
CCACAATTCCTGCTGAAAAACACTCTTCCAGTCCAAAAGCTTCTTGATACAGAAAAAAGAGCC 
TGGATTTACAGAAACATATAGATCTGGTTTGAATTCCAGATCGAGTTTACAGTTGTGAAATCT 
TGAAGGTATTACTTAACTTCACTACAGATTGTCTAGAAGACCTTTCTAGGAGTTATCTGATTC 
TAGAAGGGTCTATACTTGTCCTTGTCTTTAAGCTATTTGACAACTCTACGTGTTGTAGAAAAC 
TGATAATAATACAAATGATTGTTGTCCATGGAAAGGCAAATAAATTTTCTACAGTGAAAAAAA 
AAAAAAAA 
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FIGURE 462 

MRSCLWRCRHLSQGVQWSLLLAVLVFFLFALPSFIKEPQTKPSRHQRTENIKERSLQSLAKPK 

SQAPTRARRTTIYAEPAPENNALNTQTQPKAHTTGDRGKEANQAPPEEQDKVPHTAQRAAWKS 

PEKEKTMVNTLSPRGQDAGMASGRTEAQSWKSQDTKTTQGNGGQTRKLTASRTVSEKHQGFCAA 

TTAKTLIPKSQHRMLAPTGAVSTRTRQKGVTTAVIPPKEKKPQATPPPAPFQSPTTQRNQRLK 

AANFKSEPRWDFEEKYSFEIGGLQTTCPDSVKIKASKSLWLQKLFLPNLTLFLDSRHFNQSEW 

DRLEHFAPPFGFMELNYSLVQKVVTRFPPVPQQQLLLASLPAGSLRCITCAWGNGGILNNSH 

MGQEIDSHDYVFRLSGALIKGYEQDVGTRTSFYGFTAFSLTQSLLILGNRGFKNVPLGKDVRY 

LHFLEGTRDYEWLEALLMNQTVMSKNLFWFRHRPQEAFREALHMDRYLLLHPDFLRYMKNRFL 

RSKTLDGAHWRIYRPTTGALLLLTALQLCDQVSAYGFITEGHERFSDHYYDTSWKRLIFYINH. 

D FKL ERE VWKRLH DEGIIRLYQRPG PGT AKAKN 

Important features: 
Cytoplasmic Domain: 

amino acids 1-10 

Type II Transmembrane Domain: 

amino acids 11-35 

♦ 

Lumenal catalytic Domain: 

amino acids 36-600 

Ribonucleotide Reductase small subunit Signature: 

amino acids 481-496 

N-glycosylation Sites: 

amino acids 300-303, 311-314, 331-334, 375-378, 460-463 
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FIGURE 463 



GGGGGAGCTAGGCCGGCGGCAGTGGTGGTGGCGGCGGCGCAAGGGTGAGGGCGGCCCCAGAAC 
CCCAGGTAGGTAGAGCAAGAAGASGGTGTTTCTGCCCCTCAAATGGTCCCTTGCAACCATGTC 
ATTTCTACTTTCCTCACTGTTGGCTCTCTTAACTGTGTCCACTCCTTCATGGTGTCAGAGCAC 
TGAAGCATCTCCAAAACGTAGTGATGGGACACCATTTCCTTGGAATAAAATACGACTTCCTGA 
GTACGTCATCCCAGTTCATTATGATCTCTTGATCCATGCAAACCTTACCACGCTGACCTTCTG 
GGGAACCACGAAAGTAGAAATCACAGCCAGTCAGCCCACCAGCACCATCATCCTGCATAGTCA 
CCACCTGCAGATATCTAGGGCCACCCTCAGGAAGGGAGCTGGAGAGAGGCTATCGGAAGAACC 
CCTGCAGGTCCTGGAACACCCCCCTCAGGAGCAAATTGCACTGCTGGCTCCCGAGCCCCTCCT 
TGTCGGGCTCCCGTACACAGTTGTCATTCACTATGCTGGCAATCTTTCGGAGACTTTCCACGG 
ATTTTACAAAAGCACCTACAGAACCAAGGAAGGGGAACTGAGGATACTAGCATCAACACAATT 
TGAACCCACTGCAGCTAGAATGGCCTTTCCCTGCTTTGATGAACCTGCCTTCAAAGCAAGTTT 
CTCAATCAAAATTAGAAGAGAGCCAAGGCACCTAGCCATCTCCAATATGCCATTGGTGAAATC 
TGTGACTGTTGCTGAAGGACTCATAGAAGACCATTTTGATGTCAGTGTGAAGATGAGCACCTA 
TCTGGTGGCCTTCATCATTTCAGATTTTGAGTCTGTCAGCAAGATAACCAAGAGTGGAGTCAA 
GGTTTCTGTTTATGCTGTGCCAGACAAGATAAATCAAGCAGATTATGCACTGGATGCTGCGGT 
GACTCTTCTAGAATTTTATGAGGATTATTTCAGCATACCGTATCCCCTACCCAAACAAGATCT 
TGCTGCTATTCCCGACTTTCAGTCTGGTGCTATGGAAAACTGGGGACTGACAACATATAGAGA 
ATCTGCTCTGTTGTTTGATGCAGAAAAGTCTTCTGCATCAAGTAAGCTTGGCATCACAGTGAC 
TGTGGCCCATGAACTGGCCCACCAGTGGTTTGGGAACCTGGTCACTATGGAATGGTGGAATGA 
TCTTTGGCTAAATGAAGGATTTGCCAAATTTATGGAGTTTGTGTCTGTCAGTGTGACCCATCC 
TGAACTGAAAGTTGGAGATTATTTCTTTGGCAAATGTTTTGACGCAATGGAGGTAGATGCTTT 
AAATTCCTCACACCCTGTGTCTACACCTGTGGAAAATCCTGCTCAGATCCGGGAGATGTTTGA 
TGATGTTTCTTATGATAAGGGAGCTTGTATTCTGAATATGCTAAGGGAGTATCTTAGCGCTGA 
CGCATTTAAAAGTGGTATTGTACAGTATCTCCAGAAGCATAGCTATAAAAATACAAAAAACGA 
GGACCTGTGGGATAGTATGGCAAGTATTTGCCCTACAGATGGTGTAAAAGGGATGGATGGCTT 
TTGCTCTAGAAGTCAACATTCATCTTCATCCTCACATTGGCATCAGGAAGGGGTGGATGTGAA 
AACCATGATGAACACTTGGACACTGCAGAGGGGTTTTCCCCTAATAACCATCACAGTGAGGGG 
GAGGAATGTACACATGAAGCAAGAGCACTACATGAAGGGCTCTGACGGCGCCCCGGACACTGG 
GTACCTGTGGCATGTTCCATTGACATTCATCACCAGCAAATCCAACATGGTCCATCGATTTTT 
GCTAAAAACAAAAACAGATGTGCTCATCCTCCCAGAAGAGGTGGAATGGATCAAATTTAATGT 
GGGCATGAATGGCTATTACATTGTGCATTACGAGGATGATGGATGGGACTCTTTGACTGGCCT 
TTTAAAAGGAACACACACAGCAGTCAGCAGTAATGATCGGGCAAGTCTCATTAACAATGCATT 
TCAGCTCGTCAGCATTGGGAAGCTGTCCATTGAAAAGGCCTTGGATTTATCCCTGTACTTGAA 
ACATGAAACTGAAATTATGCCCGTGTTTCAAGGTTTGAATGAGCTGATTCCTATGTATAAGTT 
AATGGAGAAAAGAGATATGAATGAAGTGGAAACTCAATTCAAGGCCTTCCTCATCAGGCTGCT 
AAGGGACCTCATTGATAAGCAGACATGGACAGACGAGGGCTCAGTCTCAGAGCAAATGCTGCG 
GAGTGAACTACTACTCCTCGCCTGTGTGCACAACTATCAGCCGTGCGTACAGAGGGCAGAAGG 
CTATTTCAGAAAGTGGAAGGAATCCAATGGAAACTTGAGCCTGCCTGTCGACGTGACCTTGGC 
AGTGTTTGCTGTGGGGGCCCAGAGCACAGAAGGCTGGGATTTTCTTTATAGTAAATATCAGTT 
TTCTTTGTCCAGTACTGAGAAT^AGCCAAATTGAATTTGCCCTCTGCAGAACCCAAAATAAGGA 
AAAGCTTCAATGGCTACTAGATGAAAGCTTTAAGGGAGATAAAATAAAAACTCAGGAGTTTCC 
ACAAATTCTTACACTCATTGGCAGGAACCCAGTAGGATACCCACTGGCCTGGCAATTTCTGAG 
GAAAAACTGGAACAAACTTGTACAAAAGTTTGAACTTGGCTCATCTTCCATAGCCCACATGGT 
AATGGGTACAACAAATCAATTCTCCACAAGAACACGGCTTGAAGAGGTAAAAGGATTCTTCAG 
CTCTTTGAAAGAAAATGGTTCTCAGCTCCGTTGTGTCCAACAGACAATTGAAACCATTGAAGA 
AAACATCGGTTGGATGGATAAGAATTTTGATAAAATCAGAGTGTGGCTGCAAAGTGAAAAGCT 
TGAACGTATGTAAAAATTCCTCCCTTGCCCGGTTCCTGTTATCTCT7y\TCACCAACATTTTGT 
TGAGTGTATTTTCAAACTAGAGATGGCTGTTTTGGCTCCAACTGGAGATACTTTTTTCCCTTC 
AACTCATTTTTTGACTATCCCTGTGAAAAGAATAGCTGTTAGTTTTTCATGAATGGGCTTTTT 
CATGAATGGGCTATCGCTACCATGTGTTTTGTTCATCACAGGTGTTGCCCTGCAACGTAAACC 
CAAGTGTTGGGTTCCCTGCCACAGAAGAATAAAGTACCTTATTCTTCTCAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 464 

MVFLPLKWSLATMSFLLSSLLALLTVSTPSWCQSTEASPKRS DGTPFPWNKIRLPEYVIPVHY 
DLLIHANLTTLTFWGTTKVEITASQPTSTIILHSHHLQISRATLRKGAGERLSEEPLQVLEHP 
PQEQIALLAPEPLLVGLPYTWIHYAGNLSETFHGFYKSTYRTKEGELRILASTQFEPTAARM 
AFPCFDEPAFKASFSIKIRREPRHLAISNMPLVKSVTVAEGLIEDHFDVTVKMSTYLVAFIIS 
DFESVSKITKSGVKVSVYAVPDKINQADYALDAAVTLLEFYEDYFSIPYPLPKQDLAAIPDFQ 
SGAMENWGLTTYRESALLFDAEKSSASSKLGITVTVAHELAHQWFGNLVTMEWWNDLWLNEGF 
AKFMEFVSVSVTHPELKVGDYFFGKCFDAMEVDALNSSHPVSTPVENPAQIREMFDDVSYDKG 
ACILNMLREYLSADAFKSGIVQYLQKHSYKNTKNEDLWDSMASICPTDGVKGMDGFCSRSQHS 
SSSSHWHQEGVDVKTMMNTWTLQRGFPLITITVRGRNVHMKQEHYMKGSDGAPDTGYLWHVPL 
TFITSKSNMVHRFLLKTKTDVLILPEEVEWIKFNVGMNGYYIVHYEDDGWDSLTGLLKGTHTA 
VSSNDRASLINNAFQLVSIGKLSIEKALDLSLYLKHETEIMPVFQGLNELIPMYKLMEKRDMN 
EVETQFKAFLIRLLRDLIDKQTWTDEGSVSEQMLRSELLLLACVHNYQPCVQRAEGYFRKWKE 
SNGNLSLPVDVTLAVFAVGAQSTEGWDFLYSKYQFSLSSTEKSQIEFALCRTQNKEKLQWLLD 
ESFKGDKIKTQEFPQILTLIGRNPVGYPLAWQFLRKNWNKLVQKFELGSSSIAHMVMGTTNQF 
STRTRLEEVKGFFSSLKENGSQLRCVQQTIETIEENIGWMDKNFDKIRVWLQSEKLERM 

Important features: 
Signal peptide: 

amino acids 1-34 

N-glycosylation sites : 

amino acids 70-74, 154-158, 414-418, 760-764, 901-905 

Neutral zinc metallopeptidases , zinc-binding region signature: 

amino acids 350-360 
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FIGURE 465 

CAGCCACAGACGGGTCMGAGCGCGGTATTACTGCTGGCCCTCCTGGGGTTCATCCTCCCACT 
GCCAGGAGTGCAGGCGCTGCTCTGCCAGTTTGGGACAGTTCAGCATGTGTGGAAGGTGTCCGA 
CCTACCCCGGCAATGGACCCCTAAGAACACCAGCTGCGACAGCGGCTTGGGGTGCCAGGACAC 
GTTGATGCTCATTGAGAGCGGACCCCAAGTGAGCCTGGTGCTCTCCAAGGGCTGCACGGAGGC 
CAAGGACCAGGAGCCCCGCGTCACTGAGCACCGGATGGGCCCCGGCCTCTCCCTGATCTCCTA 
CACCTTCGTGTGCCGCCAGGAGGACTTCTGCAACAACCTCGTTAACTCCCTCCCGCTTTGGGC 
CCCACAGCCCCCAGCAGACCCAGGATCCTTGAGGTGCCCAGTCTGCTTGTCTATGGAAGGCTG 
TCTGGAGGGGACAACAGAAGAGATCTGCCCCAAGGGGACCACACACTGTTATGATGGCCTCCT 
CAGGCTCAGGGGAGGAGGCATCTTCTCCAATCTGAGAGTCCAGGGATGCATGCCCCAGCCAGG 
TTGCAACCTGCTCAATGGGACACAGGAAATTGGGCCCGTGGGTATGACTGAGAACTGCAATAG 
GAAAGATTTTCTGACCTGTCATCGGGGGACCACCATTATGACACACGGAAACTTGGCTCAAGA 
ACCCACTGATTGGACCACATCGAATACCGAGATGTGCGAGGTGGGGCAGGTGTGTCAGGAGAC 
GCTGCTGCTCATAGATGTAGGACTCACATCAACCCTGGTGGGGACAAAAGGCTGCAGCACTGT 
TGGGGCTCAAAATTCCCAGAAGACCACCATCCACTCAGCCCCTCCTGGGGTGCTTGTGGCCTC 
CTATACCCACTTCTGCTCCTCGGACCTGTGCAATAGTGCCAGCAGCAGCAGCGTTCTGCTGAA 
CTCCCTCCCTCCTCAAGCTGCCCCTGTCCCAGGAGACCGGCAGTGTCCTACCTGTGTGCAGCC 
CCTTGGAACCTGTTCAAGTGGCTCCCCCCGAATGACCTGCCCCAGGGGCGCCACTCATTGTTA 
TGATGGGTACATTCATCTCTCAGGAGGTGGGCTGTCCACCAAAATGAGCATTCAGGGCTGCGT 
GGCCCAACCTTCCAGCTTCTTGTTGAACCACACCAGACAAATCGGGATCTTCTCTGCGCGTGA 
GAAGCGTGATGTGCAGCCTCCTGCCTCTCAGCATGAGGGAGGTGGGGCTGAGGGCCTGGAGTC 
TCTCACTTGGGGGGTGGGGCTGGCACTGGCCCCAGCGCTGTGGTGGGGAGTGGTTTGCCCT.TC 
CTGCTAACTCTATTACCCCCACGATTCTTCACCGCTGCTGACCACCCACACTCAACCTCCCTC 
TGACCTCATAACCTAATGGCCTTGGACACCAGATTCTTTCCCATTCTGTCCATGAATCATCTT 
CCCCACACACAATCATTCATATCTACTCACCTAACAGCAACACTGGGGAGAGCCTGGAGCATC 
CGGACTTGCCCTATGGGAGAGGGGACGCTGGAGGAGTGGCTGCATGTATCTGATAATACAGAC 
CCTGTCCTTTCA 
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FIGURE 466 

MSAVLLLALLGFILPLPGVQALLCQFGTVQHVWKVSDLPRQWTPKNTSCDSGLGCQDTLMLIE 
SGPQVSLVLSKGCTEAKDQEPRVTEHRMGPGLSLISYTFVCRQEDFCNNLVNSLPLWAPQPPA 
DPGSLRC PVCLSMEGCLEGTTEE IC PKGTTHCYDGLLRLRGGGI FSNLRVQGCMPQPGCNLLN 
GTQEIGPVGMTENCNRKDFLTCHRGTTIMTHGNLAQEPTDWTTSNTEMCEVGQVCQETLLLID 
VGLTSTLVGTKGCSTVGAQNSQKTTIHSAPPGVLVASYTHFCSSDLCNSASSSSVLLNSLPPQ 
AAPVPGDRQCPTCVQPLGTCSSGSPRMTCPRGATHCYDGYIHLSGGGLSTKMSIQGCVAQPSS 
FLLNHTRQIGIFSAREKRDVQPPASQHEGGGAEGLESLTWGVGLALAPALWWGWCPSC 
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FIGURE 467 

GAGGATTTGCCACAGCAGCGGATAGAGCAGGAGAGCACCACCGGAGCCCTTGAGACATCCTTG 
AGAAGAGCCACAGCATAAGAGACTGCCCTGCTTGGTGTTTTGCAGGATGATGGTGGCCCTTCG 
AGGAGCTTCTGCATTGCTGGTTCTGTTCCTTGCAGCTTTTCTGCCCCCGCCGCAGTGTACCCA 
GGACCCAGCCATGGTGCATTACATCTACCAGCGCTTTCGAGTCTTGGAGCAAGGGCTGGAAAA 
ATGTACCCAAGCAACGAGGGCATACATTCAAGAATTCCAAGAGTTCTCAAAAAATATATCTGT 
CATGCTGGGAAGATGTCAGACCTACACAAGTGAGTACAAGAGTGCAGTGGGTAACTTGGCACT 
GAGAGTTGAACGTGCCCAACGGGAGATTGACTACATACAATACCTTCGAGAGGCTGACGAGTG 
CATCGTATCAGAGGACAAGACACTGGCAGAAATGTTGCTCCAAGAAGCTGAAGAAGAGAAAAA 
GATCCGGACTCTGCTGAATGCAAGCTGTGACAACATGCTGATGGGCATAAAGTCTTTGAAAAT 
AGTGAAGAAGATGATGGACACACATGGCTCTTGGATGAAAGATGCTGTCTATAACTCTCCAAA 
GGTGTACTTATTAATTGGATCCAGAAACAACACTGTTTGGGAATTTGCAAACATACGGGCATT 
CATGGAGGATAACACCAAGCCAGCTCCCCGGAAGCAAATCCTAACACTTTCCTGGCAGGGAAC 
AGGCCAAGTGATCTACAAAGGTTTTCTATTTTTTCATAACCAAGCAACTTCTAATGAGATAAT 
CAAATATAACCTGCAGAAGAGGACTGTGGAAGATCGAATGCTGCTCCCAGGAGGGGTAGGCCG 
AGCATTGGTTTACCAGCACTCCCCCTCAACTTACATTGACCTGGCTGTGGATGAGCATGGGCT 
CTGGGCCATCCACTCTGGGCCAGGCACCCATAGCCATTTGGTTCTCACAAAGATTGAGCCGGG 
CACACTGGGAGTGGAGCATTCATGGGATACCCCATGCAGAAGCCAGGATGCTGAAGCCTCATT 
CCTCTTGTGTGGGGTTCTCTATGTGGTCTACAGTACTGGGGGCCAGGGCCCTCATCGCATCAC 
CTGCATCTATGATCCACTGGGCACTATCAGTGAGGAGGACTTGCCCAACTTGTTCTTCCCCAA 
GAGACC7VAGAAGTCACTCCATGATCCATTACAACCCCAGAGATAAGCAGCTCTATGCCTGGAA 
TGAAGGAAACCAGATCATTTACAAACTCCAGACAAAGAGAAAGCTGCCTCTGAAGTAATGCAT 
TACAGCTGTGAGAAAGAGCACTGTGGCTTTGGCAGCTGTTCTACAGGACAGTGAGGCTATAGC 
CCCTTCACAATATAGTATCCCTCTAATCACACACAGGT^AGAGTGTGTAGAAGTGGAAATACGT 
ATGCCTCCTTTCCCAAATGTCACTGCCTTAGGTATCTTCCAAGAGCTTAGATGAGAGCATATC 
ATCAGGAAAGTTTCAACAATGTCCATTACTCCCCCAAACCTCCTGGCTCTCAAGGATGACCAC 
ATTCTGATACAGCCTACTTCAAGCCTTTTGTTTTACTGCTCCCCAGCATTTACTGTAACTCTG 
CCATCTTCCCTCCCACAATTAGAGTTGTATGCCAGCCCCTAATATTCACCACTGGCTTTTCTC 
TCCCCTGGCCTTTGCTGAAGCTCTTCCCTCTTTTTCTU^ATGTCTATTGATATTCTCCCATTTT 
CACTGCCCAACTAAAATACTATTAATATTTCTTTCTTTTCTTTTCTTTTTTTTGAGACAAGGT 
* CTCACTATGTTGCCCAGGCTGGTCTCAAACTCCAGAGCTCAAGAGATCCTCCTGCCTCAGCCT 
CCTAAGTACCTGGGATTACAGGCATGTGCCACCACACCTGGCTTAAAATACTATTTCTTATTG 
AGGTTTAACCTCTATTTCCCCTAGCCCTGTCCTTCCACTAAGCTTGGTAGATGTAATAATAAA 
GTGAAAATATTAACATTTGAATATCGCTTTCCAGGTGTGGAGTGTTTGCACATCATTGAATTC 
TCGTTTCACCTTTGTGAAACATGCACAAGTCTTTACAGCTGTCATTCTAGAGTTTAGGTGAGT 
AAC AC AAT T AC AAAGT GAAAG AT AC AGCT AG AAAAT AC T AC AAA TCCCATAGTTTTTC CAT T G 
CCCAAGGAAGCATCAAATACGTATGTTTGTTCACCTACTCTTATAGTCAATGCGTTCATCGTT 
TCAGCCTAAAAATAATAGTCTGTCCCTTTAGCCAGTTTTCATGTCTGCACAAGACCTTTCAAT 
AGGCCTTTCAAATGATAATTCCTCCAGAAAACCAGTCTAAGGGTGAGGACCCCAACTCTAGCC 
TCCTCTTGTCTTGCTGTCCTCTGTTTCTCTCTTTCTGCTTTAAATTCAATAAAAGTGACACTG 
AGCAAAAAAAAAAAAAAA 
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FIGURE 468 

MMVALRGASALLVLFLAAFLPPPQCTQDPAMVHYIYQRFRVLEQGLEKCTQATRAYIQEFQEF 
SKNISVMLGRCQTYTSEYKSAVGNLALRVERAQREIDYIQYLREADECIVSEDKTLAEMLLQE 
AEEEKKIRTLLNASCDNMLMGIKSLKIVKKMMDTHGSWMKDAVYNSPKVYLLIGSRNNTVWEF 
ANIRAFMEDNTKPAPRKQILTLSWQGTGQVIYKGFLFFHNQATSNEIIKYNLQKRTVEDRMLL 
PGGVGRALVYQHSPSTYIDLAVDEHGLWAIHSGPGTHSHLVLTKIEPGTLGVEHSWDTPCRSQ 
DAEASFLLCGVLYWYSTGGQGPHRITCIYDPLGTISEEDLPNLFFPKRPRSHSMIHYNPRDK 
QLYAWNEGNQI I YKJjQT KRKL PLK 
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FIGURE 469 

TGGCCTCCCCAGCTTGCCAGGCACAAGGCTGAGCGGGAGGAAGCGAGAGGCATCTAAGCAGGC 
AGTGTTTTGCCTTCACCCCAAGTGACCATGAGAGGTGCCACGCGAGTCTCAATCATGCTCCTC 
CTAGTAACTGTGTCTGACTGTGCTGTGATCACAGGGGCCTGTGAGCGGGATGTCCAGTGTGGG 
GCAGGCACCTGCTGTGCCATCAGCCTGTGGCTTCGAGGGCTGCGGATGTGCACCCCGCTGGGG 
CGGGAAGGCGAGGAGTGCCACCCCGGCAGCCACAAGGTCCCCTTCTTCAGGAAACGCAAGCAC 
CACACCTGTCCTTGCTTGCCCAACCTGCTGTGCTCCAGGTTCCCGGACGGCAGGTACCGCTGC 
TCCATGGACTTGAAGAACATCAATTTTTA6GCGCTTGCCTGGTCTCAGGATACCCACCATCCT 
TTTCCTGAGCACAGCCTGGATTTTTATTTCTGCCATGAAACCCAGCTCCCATGACTCTCCCAG 
TCCCTACACTGACTACCCTGATCTCTCTTGTCTAGTACGCACATATGCACACAGGCAGACATA 
CCTCCCATCATGACATGGTCCCCAGGCTGGCCTGAGGATGTCACAGCTTGAGGCTGTGGTGTG 
AAAGGTGGCCAGCCTGGTTCTCTTCCCTGCTCAGGCTGCCAGAGAGGTGGTAAATGGCAGAAA 
GGACATTCCCCCTCCCCTCCCCAGGTGACCTGCTCTCTTTCCTGGGCCCTGCCCCTCTCCCCA 
CATGTATCCCTCGGTCTGAATTAGACATTCCTGGGCACAGGCTCTTGGGTGCATTGCTCAGAG 
TCCCAGGTCCTGGCCTGACCCTCAGGCCCTTCACGTGAGGTCTGTGAGGACCAATTTGTGGGT 
AGTTCATCTTCCCTCGATTGGTTAACTCCTTAGTTTCAGACCACAGACTCAAGATTGGCTCTT 
CCCAGAGGGCAGCAGACAGTCACCCCAAGGCAGGTGTAGGGAGCCCAGGGAGGCCAATCAGCC 
CCCTGAAGACTCTGGTCCCAGTCAGCCTGTGGCTTGTGGCCTGTGACCTGTGACCTTCTGCCA 
GAATTGTCATGCCTCTGAGGCCCCCTCTTACCACACTTTACCAGTTAACCACTGAAGCCCCCA 
ATTCCCACAGCTTTTCCATTAAAATGCAAATGGTGGTGGTTCAATCTAATCTGATATTGACAT 
ATTAGAAGGCAATTAGGGTGTTTCCTTAAACAACTCCTTTCCAAGGATCAGCCCTGAGAGCAG 
GTTGGTGACTTTGAGGAGGGCAGTCCTCTGTCCAGATTGGGGTGGGAGCAAGGGACAGGGAGC 
AGGGCAGGGGCTGAAAGGGGCACTGATTCAGACCAGGGAGGCAACTACACACCAACATGCTGG 
CTTTAGAATAAAAGCACCAACTGAAAAAA 
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FIGURE 470 

MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEECHPG 
SHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF 

Important feratures: 
Signal peptide: 

amino acids 1-19 

Tyrosine kinase phosphorylation site: 

amino acids 88-95 

N-myristoylation sites: 

amino acids 33-39, 35-41, 46-52 



WO 01/40466 



PCT/US00/32678 



471/550 

FIGURE 471 

AGCGCCCGGGCGTCGGGGCGGTAAAAGGCCGGCAGAAGGGAGGCACTTGAGAAATGTCTTTCC 
TCCAGGACCCAAGTTTCTTCACCATGGGGATGTGGTCCATTGGTGCAGGAGCCCTGGGGGCTG 
CTGCCTTGGCATTGCTGCTTGCCAACACAGACGTGTTTCTGTCCAAGCCCCAGAAAGCGGCCC 
TGGAGTACCTGGAGGATATAGACCTGAAAACACTGGAGAAGGAACCAAGGACTTTCAAAGCAA 
AGGAGCTATGGGAAAAAAATGGAGCTGTGATTATGGCCGTGCGGAGGCCAGGCTGTTTCCTCT 
GTCGAGAGGAAGCTGCGGATCTGTCCTCCCTGAAAAGCATGTTGGACCAGCTGGGCGTCCCCC 
TC'TATGCAGTGGTAAAGGAGCACATCAGGACTGAAGTGAAGGATTTCCAGCCTTATTTCAAAG 
GAGAAATCTTCCTGGATGAAAAGAAAAAGTTCTATGGTCCACAAAGGCGGAAGATGATGTTTA 
TGGGATTTATCCGTCTGGGAGTGTGGTACAACTTCTTCCGAGCCTGGAACGGAGGCTTCTCTG 
GAAACCTGGAAGGAGAAGGCTTCATCCTTGGGGGAGTTTTCGTGGTGGGATCAGGAAAGCAGG 
GCATTCTTCTTGAGCACCGAGAAAAAGAATTTGGAGACAAAGTAAACCTACTTTCTGTTCTGG 
AAGCTGCTAAGATGATCAAACCACAGACTTTGGCCTCAGAGAAAAAATGATTGTGTGAAACTG 
CCCAGCTCAGGGATAACCAGGGACATTCACCTGTGTTCATGGGATGTATTGTTTCCACTCGTG 
TCCCTAAGGAGTGAGAAACCCATTTATACTCTACTCTCAGTATGGATTATTAATGTATTTTAA 
TATTCTGTTTAGGCCCACTAAGGCAAAATAGCCCCAAJ^ACAAGACTGACAAAAATCTGAAAAA 
CTAATGAGGATTATTAAGCTAAAACCTGGGAAATAGGAGGCTTAAAATTGACTGCCAGGCTGG 
GTGCAGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGAGCAAG'TCACTTGAG 
GTCGGGAGTTCGAGACCAGCCTGAGCAACATGGCGAAACCCCGTCTCTACTAAAAATACAAAA 
ATCACCCGGGTGTGGTGGCAGGCACCTGTAGTCCCAGCTACCCGGGAGGCTGAGGCAGGAGAA 
TCACTTGAACCTGGGAGGTGGAGGTTGCGGTGAGCTGAGATCACACCACTGTATTCCAGCCTG 
GGTGACTGAGACTCTAACTAA 
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FIGURE 472 

MSFLQDPSFFTMGMWSIGAGALGAAALALLLANTDVFLSKPQKAALEYLEDIDLKTLEKEPRT 
FKAKELWEKNGAVII4AVRRPGCFLCREEAADLSSLKSMLDQLGVPLYAWKEHIRTEVKDFQP 
YFKGEIFLDEKKKFYG PQRRKMMFMG F I RLG VW YN F FRAWNGG FS GNLEGEG FI LGGV FV VG S 
GKQGILLEHREKEFGDKVNLLSVLEAAKMIKPQTLASEKK 
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FIGURE 473 

AATATATCATCTATTTATCATTAATCAATAATGTATTCTTTTATTCCAATAACATTTGGGTTT 
TGGGATTTTAATTTTCAAACACAGCAGAATGACATTTTTTCTGTCACTATTATTATTGTTGGT 
ATGTGAAGCTATTTGGAGATCCAATTCAGGAAGCAACACATTGGAGAATGGCTACTTTCTATC 
AAGAAATAAAGAGAACCACAGTCAACCCACACAATCATCTTTAGAAGACAGTGTGACTCCTAC 
CAAAGCTGTCAAAACCACAGGCAAGGGCATAGTTAAAGGACGGAATCTTGACTCAAGAGGGTT 
AATTCTTGGTGCTGAAGCCTGGGGCAGGGGTGTAAAGAAAAACACTTAGATTCAATGATTGTA 
AATTTAAGGCAAATACACATATTAGTATTACCTTAGTGTAATGTATCCCTGTCATATATACAA 
TAAGGTGAAATTATAAGTACCCTATGCAGTTGGCTGGACAGTTCTAAATTGGACTTTATTAAT 
TTTTAAAATCAGTAACTGATTTATCACTGGCTATGTGCTTAGATCTACAGGAGATCATATAAT 
TTGATACAAATAAAAGAAAAGTGTTCTCTCCCCTTACAGAATTGACATTTTAAATGCGATACA 
GTTAGAATAGGAAATATGACATTAGAAAGGAAGAATGACAGGGAGAAAGGAAAGAAGGGAAAA 
TGTTGCCAAGGAAAAAAAAA 



WO 01/40466 



PCT/US00/32678 



474/550 

FIGURE 474 

MTFFLSLLLLLVCEAIWRSNSGSNTLENGYFLSRNKENHSQPTQSSLEDSVTPTKAVKTTGKG 
IVKGRNLDSRGLILGAEAWGRGVKKNT 
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FIGURE 475 

GACAGTGGAGGGCAGTGGAGAGGACCGCGCTGTCCTGCTGTCACCAAGAGCTGGAGACACCAT 

CTCCCACCGAGAGTCATGGCCCCATTGGCCCTGCACCTCCTCGTCCTCGTCCCCATCCTCCTC 

AGCCTGGTGGCCTCCCAGGACTGGAAGGCTGAACGCAGCCAAGACCCCTTCGAGAAATGCATG 

CAGGATCCTGACTATGAGCAGCTGCTCAAGGTGGTGACCTGGGGGCTCAATCGGACCCTGAAG 

CCCCAGAGGGTGATTGTGGTTGGCGCTGGTGTGGCCGGGCTGGTGGCCGCCAAGGTGCTCAGC 

GATGCTGGACACAAGGTCACCATCCTGGAGGCAGATAACAGGATCGGGGGCCGCATCTTCACC 

TACCGGGACCAGAACACGGGCTGGATTGGGGAGCTGGGAGCCATGCGCATGCCCAGCTCTCAC 

AGGATCCTCCACAAGCTCTGCCAGGGCCTGGGGCTCAACCTGACCAAGTTCACCCAGTACGAC 

AAGAACACGTGGACGGAGGTGCACGAAGTGAAGCTGCGCAACTATGTGGTGGAGAAGGTGCCC 

GAGAAGCTGGGCTACGCCTTGCGTCCCCAGGAAAAGGGCCACTCGCCCGAAGACATCTACCAG 

ATGGCTCTCAACCAGGCCCTCAAAGACCTCAAGGCACTGGGCTGCAGAAAGGCGATGAAGAAG 

TTTGAAAGGCACACGCTCTTGGAATATCTTCTCGGGGAGGGGAACCTGAGCCGGCCGGCCGTG 

CAGCTTCTGGGAGACGTGATGTCCGAGGATGGCTTCTTCTATCTCAGCTTCGCCGAGGCCCTC 

CGGGCCCACAGCTGCCTCAGCGACAGACTCCAGTACAGCCGCATCGTGGGTGGCTGGGACCTG 

CTGCCGCGCGCGCTGCTGAGCTCGCTGTCCGGGCTTGTGCTGTTGAACGCGCCCGTGGTGGCG 

ATGACCCAGGGACCGCACGATGTGCACGTGCAGATCGAGACCTCTCCCCCGGCGCGGAATCTG 

AAGGTGCTGAAGGCCGACGTGGTGCTGCTGACGGCGAGCGGACCGGCGGTGAAGCGCATCACC 

TTCTCGCCGCCGCTGCCCCGCCACATGCAGGAGGCGCTGCGGAGGCTGCACTACGTGCCGGCC 

ACCAAGGTGTTCCTAAGCTTCCGCAGGCCCTTCTGGCGCGAGGAGCACATTGAAGGCGGCCAC 

TCAAACACCGATCGCCCGTCGCGCATGATTTTCTACCCGCCGCCGCGCGAGGGCGCGCTGCTG 

CTGGCCTCGTACACGTGGTCGGACGCGGCGGCAGCGTTCGCCGGCTTGAGCCGGGAAGAGGCG 

TTGCGCTTGGCGCTCGACGACGTGGCGGCATTGCACGGGCCTGTCGTGCGCCAGCTCTGGGAC 

GGCACCGGCGTCGTCAAGCGTTGGGCGGAGGACCAGCACAGCCAGGGTGGCTTTGTGGTACAG 

CCGCCGGCGCTCTGGCAAACCGAAAAGGATGACTGGACGGTCCCTTATGGCCGCATCTACTTT 

GCCGGCGAGCACACCGCCTACCCGCACGGCTGGGTGGAGACGGCGGTCAAGTCGGCGCTGCGC 

GCCGCCATCAAGATCAACAGCCGGAAGGGGCCTGCATCGGACACGGCCAGCCCCGAGGGGCAC * 

GCATCTGACATGGAGGGGCAGGGGCATGTGCATGGGGTGGCCAGCAGCCCCTCGCATGACCTG 

GCAAAGGAAGAAGGCAGCCACCCtCCAGTCCAAGGCCAGTTATCTCTCCAAAACACGACCCAC 

ACGAGGACCTCGCATTAAAGTATTTTCGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAA 
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FIGURE 476 

MAPLALHLLVLVPILLSLVASQDWKAERSQDPFEKCMQDPDYEQLLKVVTWGLNRTLKPQRVI 
WGAGVAGLVAAKVLS DAGHKVT I LEADNRI GGRI FT YRDQNTGWI GELGAMRMPS SHRI LHK 
LCQGLGLNLTKFTQYDKNTWTEVHEVKLRNYVVEKVPEKLGYALRPQEKGHSPEDIYQMALNQ 
ALKDLKALGCRKAMKKFERHTLLEYLLGEGNLSRPAVQLLGDVMSEDGFFYLSFAEALRAHSC 
LSDRLQYSRIVGGWDLLPRALLSSLSGLVLLNAPWAMTQGPHDVHVQIETSPPARNLKVLKA 
DWLLTASGPAVKRITFSPPLPRHMQEALRRLHYVPATKVFLSFRRPFWREEHIEGGHSNTDR 
PSRMIFYPPPREGALLLASYTWSDAAAAFAGLSREEALRLALDDVAALHGPWRQLWDGTGVV 
KRWAEDQHSQGGFWQPPALWQTEKDDWTVPYGRIYFAGEHTAYPHGWVETAVKSALRAAIKI 
NSRKGPASDTASPEGHASDMEGQGHVHGVASSPSHDLAKEEGSHPPVQGQLSLQNTTHTRTSH 

Important features : 
Signal peptide: 

amino acids 1-21 
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FIGURE 477 



CTGACATGGCCTGACTCGGGACAGCTCAGAGCAGGGCAGAACTGGGGACACTCTGGGCCGGCCTTCTGCCTGCAT 
GGACGCTCTGAAGCCACCCTGTCTCTGGAGGAACCACGAGCGAGGGAAGAAGGACAGGGACTCGTGTGGCAGGAA 
GAACTCAGAGCCGGGAAGCCCCCATTCACTAGAAGCACTGAGAGATGCGGCCCCCTCGCAGGGTCTGAATTTCCT 
GCTGCTGTTCACAAAGATGCTTTTTATCTTTAACTTTTTGTTTTCCCCACTTCCGACCCCGGCGTTGATCTGCAT 
CCTGACATTTGGAGCTGCCATCTTCTTGTGGCTGATCACCAGACCTCAACCCGTCTTACCTCTTCTTGACCTGAA 
CAATCAGTCTGTGGGAATTGAGGGAGGAGCACGGAAGGGGGTTTCCCAGAAGAACAATGACCTAACAAGTTGCTG 
CTTCTCAGATGCCAAGACTATGTATGAGGTTTTCCAAAGAGGACTCGCTGTGTCTGACAATGGGCCCTGCTTGGG 
ATATAGAAAACCAAACCAGCCCTACAGATGGCTATCTTACAAACAGGTGTCTGATAGAGCAGAGTACCTGGGTTC 
CTGTCTCTTGCATAAAGGTTATAAATCATCACCAGACCAGTTTGTCGGCATCTTTGCTCAGAATAGGCCAGAGTG 
GATCATCTCCGAATTGGCTTGTTACACGTACTCTATGGTAGCTGTACCTCTGTATGACACCTTGGGACCAGAAGC 
CATCGTACATATTGTCAACAAGGCTGATATCGCCATGGTGATCTGTGACACACCCCAAAAGGCATTGGTGCTGAT 
AGGGAATGTAGAGAAAGGCTTCACCCCGAGCCTGAAGGTGATCATCCTTATGGACCCCTTTGATGATGACCTGAA 
GCAAAGAGGGGAGAAGAGTGGAATTGAGATCTTATCCCTATATGATGCTGAGAACCTAGGCAAAGAGCACTTCAG 
AAAACCTGTGCCTCCTAGCCCAGAAGACCTGAGCGTCATCTGCTTCACCAGTGGGACCACAGGTGACCCCAAAGG 
AGCCATGATAACCCATCAAAATATTGTTTCAAATGCTGCTGCCTTTCTCAAATGTGTGGAGCATGCTTATGAGCC 
CACTCCTGATGATGTGGCCATATCCTACCTCCCTCTGGCTCATATGTTTGAGAGGATTGTACAGGCTGTTGTGTA 
CAGCTGTGGAGCCAGAGTTGGATTCTTCCAAGGGGATATTCGGTTGCTGGCTGACGACATGAAGACTTTGAAGCC 
CACATTGTTTCCCGCGGTGCCTCGACTCCTTAACAGGATCTACGATAAGGTACAAAATGAGGCCAAGACACCCTT 
GAAGAAGTTCTTGTTGAAGCTGGCTGTTTCCAGTAAATTCAAAGAGCTTCAAAAGGGTATCATCAGGCATGATAG 
TTTCTGGGACAAGCTCATCTTTGCAAAGATCCAGGACAGCCTGGGCGGAAGGGTTCGTGTAATTGTCACTGGAGC 
TGCCCCCATGTCCACTTCAGTCATGACATTCTTCCGGGCAGCAATGGGATGTCAGGTGTATGAAGCTTATGGTCA 
AACAGAATGCACAGGTGGCTGTACATTTACATTACCTGGGGACTGGACATCAGGTCACGTTGGGGTGCCCCTGGC 
TTGCAATTACGTGAAGCTGGAAGATGTGGCTGACATGAACTACTTTACAGTGAATAATGAAGGAGAGGTCTGCAT 
CAAGGGTACAAACGTGTTCAAAGGATACCTGAAGGACCCTGAGAAGACACAGGAAGCCCTGGACAGTGATGGCTG 
GCTTCACACAGGAGACATTGGTCGCTGGCTCCCGAATGGAACTCTGAAGATCATCGACCGTAAAAAGAACATTTT 
CAAGCTGGCCCAAGGAGAATACATTGCACCAGAGAAGATAGAAAATATCTACAACAGGAGTCAACCAGTGTTACA 
AATTTTTGTACACGGGGAGAGCTTACGGTCATCCTTAGTAGGAGTGGTGGTTCCTGACACAGATGTACTTCCCTC 
ATTTGCAGCCAAGCTTGGGGTGAAGGGCTCCTTTGAGGAACTGTGCCAAAACCAAGTTGTAAGGGAAGCCATTTT 
AGAAGACTTGCAGAAAATTGGGAAAGAAAGTGGCCTTAAAACTTTTGAACAGGTCAAAGCCATTTTTCTTCATCC 
AGAGCCATTTTCCATTGAAAATGGGCTCTTGACACCAACATTGAAAGCAAAGCGAGGAGAGCTTTCCAAATACTT 
TCGGACCCAAATTGACAGCCTGTATGAGCACATCCAGGATTAGGATAAGGTACTTAAGTACCTGCCGGCCCACTG 
TGCACTGCTTGTGAGAAAATGGATTAAAAACTATTCTTACATTTGTTTTGGCTTTCCTCCTATTTTTTTTTAACC 
TGTTAAACTCTAAAGCCATAGCTTTTGTTTTATATTGAGACATATAATGTGTAAACTTAGTTCCCAAATAAATCA 
ATCCTGTCTTTCCCATCTTCGATGTTGCTAATATTAAGGCTTCAGGGCTACTTTTATCAACATGCCTGTCTTCAA 
GATCCCAGTTTATGTTCTGTGTCCTTCCTCATGATTTCCAACCTTAATACTATTAGTAACCACAAGTTCAAGGGT 
CAAAGGGACCCTCTGTGCCTTCTTCTTTGTTTTGTGATAAACATAACTTGCCAACAGTCTCTATGCTTATTTACA 
TCTTCTACTGTTCAAACTAAGAGATTTTTAAATTCTGAAAAACTGCTTACAATTCATGTTTTCTAGCCACTCCAC 
AAACCACTAAAATTTTAGTTTTAGCCTATCACTCATGTCAATCATATCTATGAGACAAATGTCTCCGATGCTCTT 
CTGCGTAAATTAAATTGTGTACTGAAGGGAAAAGTTTGATCATACCAAACATTTCCTAAACTCTCTAGTTAGATA 
TCTGACTTGGGAGTATTAAAAATTGGGTCTATGACATACTGTCCAAAAGGAATGCTGTTCTTAAAGCATTATTTA 
CAGTAGGAACTGGGGAGTAAATCTGTTCCCTACAGTTTGCTGCTGAGCTGGAAGCTGTGGGGGAAGGAGTTGACA 
GGTGGGCGCAGTGAACTTTTCCAGTAAATGAAGCAAGCACTGAATAAAAACCTCCTGAACTGGGAACAAAGATCT 
ACAGGCAAGCAAGATGCCCACACAACAGGCTTATTTTCTGTGAAGGAACCAACTGATCTCCCCCACCCTTGGATT 
AGAGTTCCTGCTCTACCTTACCCACAGATAACACATGTTGTTTCTACTTGTAAATGTAAAGTCTTTAAAATAAAC 
TAT T AC AGATAAAAAA 
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FIGURE 478 

MDALKPPCLWRNHERGKKDRDSCGRKNSEPGSPHSLEALRDAAPSQGLNFLLLFTKMLFIFNF 
LFSPLPTPAL'ICILTFGAAIFLWLITRPQPVLPLLDLNNQSVGIEGGARKGVSQKNNDLTSCC 
FSDAKTMYEVFQRGLAVSDNGPCLGYRKPNQPYRWLSYKQVSDRAEYLGSCLLHKGYKSSPDQ 
FVGIFAQNRPEWIISELACYTYSMVAVPLYDTLGPEAIVHIVNKADIAMVICDTPQKALVLIG 
NVEKGFTPSLKVIILMDPFDDDLKQRGEKSGIEILSLYDAENLGKEHFRKPVPPSPEDLSVIC 
FTSGTTGDPKGAMITHQNIVSNAAAFLKCVEHAYEPTPDDVAISYLPLAHMFERIVQAWYSC 
GARVGFFQGDIRLLADDMKTLKPTLFPAVPRLLNRIYDKVQNEAKTPLKKFLLKLAVSSKFKE 
LQKGIIRHDSFWDKLIFAKIQDSLGGRVRVIVTGAAPMSTSVMTFFRAAMGCQVYEAYGQTEC 
TGGCTFTLPGDWTSGHVGVPLACNYVKLEDVADMNYFTVNNEGEVCIKGTNVFKGYLKDPEKT 
QEALDSDGWLHTGDIGRWLPNGTLKIIDRKKNIFKLAQGEYIAPEKIENIYNRSQPVLQIFVH 
GESLRSSLVGVWPDTDVLPSFAAKLGVKGSFEELCQNQVVREAILEDLQKIGKESGLKTFEQ 
VKAIFLHPEPFSIENGLLTPTLKAKRGELSKYFRTQIDSLYEHIQD 

Important features: 

Type II transmembrane domain: 

amino acids 61-80 

Putative AMP-binding domain signature. 

amino acids 314-325 

N-glycosylation site. 

amino acids 102-105, 588-591 and 619-622 



WO 01/40466 



PCT/US00/32678 



479/550 

FIGURE 479 

GGAGGCGGAGGCCGCGGCGAGCCGGGCCGAGCAGTGAGGGCCCTAGCGGGGCCCGAGCGGGGC 
CCGGGGCCCCTAAGCCATTCCTGAAGTCATGGGCTGGCCAGGACATTGGTGACCCGCCAATCC 
GGTATGGACGACTGGAAGCCCAGCCCCCTCATCAAGCCCTTTGGGGCTCGGAAGAAGCGGAGC 
TGGTACCTTACCTGGAAGTATAAACTGACAAACCAGCGGGCCCTGCGGAGATTCTGTCAGACA 
GGGGCCGTGCTTTTCCTGCTGGTGACTGTCATTGTCAATATCAAGTTGATCCTGGACACTCGG 
CGAGCCATCAGTGAAGCCAATGAAGACCCAGAGCCAGAGCAAGACTATGATGAGGCCCTAGGC 
CGCCTGGAGCCCCCACGGCGCAGAGGCAGTGGTCCCCGGCGGGTCCTGGACGTAGAGGTGTAT 
TCAAGTCGCAGCAAAGTATATGTGGCAGTGGATGGCACCACGGTGCTGGAGGATGAGGCCCGG 
GAGCAGGGCCGGGGCATCCATGTCATTGTCCTCAACCAGGCCACGGGCCACGTGATGGCAAAA 
CGTGTGTTTGACACGTACTCACCTCATGAGGATGAGGCCATGGTGCTATTCCTCAACATGGTA 
GCGCCCGGCCGAGTGCTCATCTGCACTGTCAAGGATGAGGGCTCCTTCCACCTCAAGGACACA 
GCCAAGGCTCTGCTGAGGAGCCTGGGCAGCCAGGCTGGCCCTGCCCTGGGCTGGAGGGACACA 
TGGGCCTTCGTGGGACGAAAAGGAGGTCCTGTCTTCGGGGAGAAACATTCTAAGTCACCTGCC 
CTCTCTTCCTGGGGGGACCCAGTCCTGCTGAAGACAGATGTGCCATTGAGCTCAGCAGAAGAG 
GCAGAGTGCCACTGGGCAGACACAGAGCTGAACCGTCGCCGCCGGCGCTTCTGCAGCAAAGTT 
GAGGGCTATGGAAGTGTATGCAGCTGCAAGGACCCCACACCCATCGAGTTCAGCCCTGACCCA 
CTCCCAGACAACAAGGTCCTCAATGTGCCTGTGGCTGTCATTGCAGGGAACCGACCCAATTAC 
CTGTACAGGATGCTGCGCTCTCTGCTTTCAGCCCAGGGGGTGTCTCCTCAGATGATAACAGTT 
TTCATTGACGGCTACTATGAGGAACCCATGGATGTGGTGGCACTGTTTGGTCTGAGGGGCATC 
CAGCATACTCCCATCAGCATCAAGAATGCCCGCGTGTCTCAGCACTACAAGGCCAGCCTCACT 
GCCACTTTCAACCTGTTTCCGGAGGCCAAGTTTGCTGTGGTTCTGGAAGAGGACCTGGACATT 
GCTGTGGATTTTTTCAGTTTCCTGAGCCAATCCATCCACCTACTGGAGGAGGATGACAGCCTG 
TACTGCATCTCTGCCTGGAATGACCAGGGGTATGAACACACGGCTGAGGACCCAGCACTACTG 
TACCGTGTGGAGACCATGCCTGGGCTGGGCTGGGTGCTCAGGAGGTCCTTGTACAAGGAGGAG 
CTTGAGCCCAAGTGGCCTACACCGGAAAAGCTCTGGGATTGGGACATGTGGATGCGGATGCCT 
GAACAACGCGGGGGCCGAGAGTGCATCATCCCTGACGTTTCCCGATCCTACCACTTTGGCATC 
GTCGGCCTCAACATGAATGGCTACTTTCACGAGGCCTACTTCAAGAAGCACAAGTTCAACACG 
GTTCCAGGTGTCCAGCTCAGGAATGTGGACAGTCTGAAGAAAGAAGCTTATGAAGTGGAAGTT 
CACAGGCTGCTCAGTGAGGCTGAGGTTCTGGACCACAGCAAGAACCCTTGTGAAGACTCTTTC 
CTGCCAGACACAGAGGGCCACACCTACGTGGCCTTTATTCGAATGGAGAAAGATGATGACTTC 
ACCACCTGGACCCAGCTTGCCAAGTGCCTCCATATCTGGGACCTGGATGTGCGTGGCAACCAT 
CGGGGCCTGTGGAGATTGTTTCGGAAGAAGAACCACTTCCTGGTGGTGGGGGTCCCGGCTTCC 
CCCTACTCAGTGAAGAAGCCACCCTCAGTCACCCCAATTTTCCTGGAGCCACCCCCAAAGGAG 
GAGGGAGCCCCAGGAGCCCCAGAACAGACATGAGACCTCCTCCAGGACCCTGCGGGGCTGGGT 
ACTGTGTACCCCCAGGCTGGCTAGCCCTTCCCTCCATCCTGTAGGATTTTGTAGATGCTGGTA 
GGGGCTGGGGCTACCTTGTTTTTAACATGAGACTTAATTACTAACTCCAAGGGGAGGGTTCCC 
CTGCTCCAACACCCCGTTCCTGAGTTAAAAGTCTATTTATTTACTTCCTTGTTGGAGAAGGGC 
AGGAGAGTACCTGGGAATCATTACGATCCCTAGCAGCTCATCCTGeCCTTTGAATACCCTCAC 
TTTCCAGGCCTGGCTCAGAATCTAACCTATTTATTGACTGTCCTGAGGGCCTTGAAAACAGGC 
CGAACCTGGAGGGCCTGGATTTCTTTTTGGGCTGGAATGCTGCCCTGAGGGTGGGGCTGGCTC 
TTACTCAGGAAACTGCTGTGCCCAACCCATGGACAGGCCCAGCTGGGGCCCACATGCTGACAC 
AGACTCACTCAGAGACCCTTAGACACTGGACCAGGCCTCCTCTCAGCCTTCTCTTTGTCCAGA 
TTTCCAAAGGTGGATAAGTTGGTCATTGATTAAAAAAGGAGAAGCCCTCTGGGAAAAAAAAAA 
AAAAAAAAAAAAAAAA 
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FIGURE 480 

MDDWKPSPLIKPFGARKKRSWYLTWKYKLTNQRALRRFCQTGAVLFLLVTVIVNIKLILDTRR 
AISEANEDPEPEQDYDEALGRLEPPRRRGSGPRRVLDVEVYSSRSKVYVAVDGTTVLEDEARE 
QGRGIHVIVLNQATGHVMAKRVFDTYSPHEDEAMVLFLNMVAPGRVLICTVKDEGSFHLKDTA 
KALLRSLGSQAGPALGWRDTWAFVGRKGGPVFGEKHSKSPALSSWGDPVLLKTDVPLSSAEEA 
ECHWADTELNRRRRRFCSKVEGYGSVCSCKDPTPIEFSPDPLPDNKVLNVPVAVIAGNRPNYL 
YRMLRSLLSAQGVSPQMITVFIDGYYEEPMDVVALFGLRGIQHTPISIKNARVSQHYKASLTA 
TFNLFPEAKFAWLEEDLDIAVDFFSFLSQSIHLLEEDDSLYCISAWNDQGYEHTAEDPALLY 
RVETMPGLGWVLRRSLYKEELEPKWPTPEKLWDWDMWMRMPEQRRGRECIIPDVSRSYHFGIV 
GLNMNGYFHEAYFKKHKFNTVPGVQLRNVDSLKKEAYEVEVHRLLSEAEVLDHSKNPCEDSFL 
PDTEGHTYVAFIRMEKDDDFTTWTQLAKCLHIWDLDVRGNHRGLWRLFRKKNHFLWGVPASP 
YSVKKPPSVTPIFLEPPPKEEGAPGAPEQT 

Important features: 
Transmembrane domain: 

amino acids 38-55 

Homologous region to Mouse GNT1 

amino acids 229-660 
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FIGURE 481 

GAAAGAATGTTGTGGCTGCTCTTTTTTCTGGTGACTGCCATTCATGCTGAACTCTGTCAACCA 
GGTGCAGAAAATGCTTTTAAAGTGAGACTTAGTATCAGAACAGCTCTGGGAGATAAAGCATAT 
GCCTGGGATACCAATGAAGAATACCTCTTCAAAGCGATGGTAGCTTTCTCCATGAGAAAAGTT 
CCCAACAGAGAAGCAACAGAAATTTCCCATGTCCTACTTTGCAATGTAACCCAGAGGGTATCA 
TTCTGGTTTGTGGTTACAGACCCTTCAAAAAATCACACCCTTCCTGCTGTTGAGGTGCAATCA 
GCCATAAGAATGAACAAGAACCGGATCAACAATGCCTTCTTTCTAAATGACCAAACTCTGGAA 
TTTTTAAAAATCCCTTCCACACTTGCACCACCCATGGACCCATCTGTGCCCATCTGGATTATT 
ATATTTGGTGTGATATTTTGCATCATCATAGTTGCAATTGCACTACTGATTTTATCAGGGATC 
TGGCAACGTAGAAGAAAGAACAAAGAACCATCTGAAGTGGATGACGCTGAAGATAAGTGTGAA 
AACATGATCACAATTGAAAAT.GGCATCCCCTCTGATCCCCTGGACATGAAGGGGGGCATATTA 
ATGATGCCTTCATGACAGAGGATGAGAGGCTCACCCCTCTCTGAAGGGCTGTTGTTCTGCTTC 
CTCAAGAAATTAAACATTTGTTTCTGTGTGACTGCTGAGCATCCTGAAATACCAAGAGCAGAT 
CATATATTTTGTTTCACCATTCTTCTTTTGTAATAAATTTTGT^ATGTGCTTGAAAGTGAAAAG 
CAATCAATTATACCCACCAACACCACTGAAATCATAAGCTATTCACGACTCAAAATATTCTAA 
AATATTTTTCTGACAGTATAGTGTATAAATGTGGTCATGTGGTATTTGTAGTTATTGATT.TAA 
GCATTTTTAGAAATAAGATCAGGCATATGTATATATTTTCACACTTCAAAGACCTAAGGAAAA 
ATAAATTTTCCAGTGGAGAATACATATAATATGGTGTAGAAATCATTGAAAATGGATCCTTTT 
TGACGATCACTTATATCACTCTGTATATGACT71AGTAAACAAAAGTGAGAAGTAATTATTGTA 
AATGGATGGATAAAAATGGAATTACTCATATACAGGGTGGAATTTTATCCTGTTATCACACCA 
ACAGTTGATTATATATTTTCTGAATATCAGCCCCTAATAGGACAATTCTATTTGTTGACCATT 
TCTACAATTTGTAAAAGTCCAATCTGTGCTAACTTAATAAAGTAATAATCATCTCTTTTTAAA 
AAAAAAAAAAAAAAAAAAAAAAA 



WO 01/40466 



PCT/US00/32678 



482/550 

FIGURE 482 

MLWLLFFLVTAIHAELCQPGAENAFKVRLSIRTALGDKAYAWDTNEEYLFKAMVAFSMRKVPN 
REATEISHVLLCNVTQRVSFWFWTDPSKNHTLPAVEVQSAIRMNKNRINNAFFLNDQTLEFL 
KI PSTLAPPMDPSVPIWI 1 1 FGVIFCII I VAIALLILSGIWQRRRKNKEPSEVDDAEDKCENM 
ITIENGIPSDPL DMKGG I LMM PS 



WO 01/40466 



PCT/US00/32678 



483/550 

FIGURE 483 

CGTCTCTGCGTTCGCCATGCGTCCCGGGGCGCCAGGGCCACTCTGGCCTCTGCCCTGGGGGGC 
CCTGGCTTGGGCCGTGGGCTTCGTGAGCTCCATGGGCTCGGGGAACCCCGCGCCCGGTGGTGT 
TTGCTGGCTCCAGCAGGGCCAGGAGGCCACCTGCAGCCTGGTGCTCCAGACTGATGTCACCCG 
GGCCGAGTGCTGTGCCTCCGGCAACATTGACACCGCCTGGTCCAACCTCACCCACCCGGGGAA 
CAAGATCAACCTCCTCGGCTTCTTGGGCCTTGTCCACTGCCTTCCCTGCAAAGATTCGTGCGA 
CGGCGTGGAGTGCGGCCCGGGCAAGGCGTGCCGCATGCTGGGGGGCCGCCCGCGCTGCGAGTG 
CGCGCCCGACTGCTCGGGGCTCCCGGCGCGGCTGCAGGTCTGCGGCTCAGACGGCGCCACCTA 
CCGCGACGAGTGCGAGCTGCGCGCCGCGCGCTGGCGCGGCCACCCGGACCTGAGCGTCATGTA 
CCGGGGCCGCTGCCGC7^AGTCCTGTGAGCACGTGGTGTGCCCGCGGCCACAGTCGTGCGTCGT 
GGACCAGACGGGCAGCGCCCACTGCGTGGTGTGTCGAGCGGCGCCCTGCCCTGTGCCCTCCAG 
CCCCGGCCAGGAGCTTTGCGGCAACAACAACGTCACCTACATCTCCTCGTGCCACATGCGCCA 
GGCCACCTGCTTCCTGGGCCGCTCCATCGGCGTGCGCCACGCGGGCAGCTGCGCAGGCACCCC 
TGAGGAGCCGCCAGGTGGTGAGTCTGCAGAAGAGGAAGAGAACTTCGTGTGAGCCTGCAGGAC 
AGGCCTGGGCCTGGTGCCCGAGGCCCCCCATCATCCCCTGTTATTTATTGCCACAGCAGAGTC 
TAATTTATATGCCACGGACACTCCTTAGAGCCCGGATTCGGACCACTTGGGGATCCCAGAACC 
TCCCTGACGATATCCTGGAAGGACTGAGGAAGGGAGGCCTGGGGGCCGGCTGGTGGGTGGGAT 
AGACCTGCGTTCCGGACACTGAGCGCCTGATTTAGGGCCCTTCTCTAGGATGCCCCAGCCCCT 
ACCCTAAGACCTATTGCCGGGGAGGATTCCACACTTCCGCTCCTTTGGGGATT^AACCTATTAA 
TTATTGCTACTATCAAGAGGGCTGGGCATTCTCTGCTGGTAATTCCTGAAGAGGCATGACTGC 
TTTTCTCAGCCCCAAGCCTCTAGTCTGGGTGTGTACGGAGGGTCTAGCCTGGGTGTGTACGGA 
GGGTCTAGCCTGGGTGAGTACGGAGGGTCTAGCCTGGGTGAGTACGGAGGGTCTAGCCTGGGT 
GAGTACGGAGGGTCTAGCCTGGGTGTGTATGGAGGATCTAGCCTGGGTGAGTATGGAGGGTCT 
AGCCTGGGTGAGTATGGAGGGTCTAGCCTGGGTGTGTATGGAGGGTCTAGCCTGGGTGAGTAT 
GGAGGGTCTAGCCTGGGTGTGTATGGAGGGTCTAGCCTGGGTGAGTATGGAGGGTCTAGCCTG 
GGTGTGTACGGAGGGTCTAGTCTGAGTGCGTGTGGGGACCTCAGAACACTGTGACCTTAGCCC 
AGCAAGCCAGGCCCTTCATGAAGGCCAAGAAGGCTGCCACCATTCCCTGCCAGCCCAAGAACT 
CCAGCTTCCCCACTGCCTCTGTGTGCCCCTTTGCGTCCTGTGAAGGCCATTGAGAAATGCCCA 
GTGTGCCCCCTGGGAAAGGGCACGGCCTGTGCTCCTGACACGGGCTGTGCTTGGCCACAGAAC 
CACCCAGCGTCTCCCCTGCTGCTGTCCACGTCAGTTCATGAGGCAACGTCGCGTGGTCTCAGA 
CGTGGAGCAGCCAGCGGCAGCTCAGAGCAGGGCACTGTGTCCGGCGGAGCCAAGT.CCACTCTG 
GGGGAGCTCTGGCGGGGACCACGGGCCACTGCTCACCCACTGGCCCCGAGGGGGGTGTAGACG 
CCAAGACTCACGCATGTGTGACATCCGGAGTCCTGGAGCCGGGTGTCCCAGTGGCACCACTAG 
GTGCCTGCTGCCTCCACAGTGGGGTTCACACCCAGGGCTCCTTGGTCCCCCACAACCTGCCCC 
GGCCAGGCCTGCAGACCCAGACTCCAGCCAGACCTGCCTCACCCACCAATGCAGCCGGGGCTG 
GCGACACCAGCCAGGTGCTGGTCTTGGGCCAGTTCTCCCACGACGGCTCACCCTCCCCTCCAT 
CTGCGTTGATGCTCAGAATCGCCTACCTGTGCCTGCGTGTAAACCACAGCCTCAGAGCAGCTA 
TGGGGAGAGGACAACACGGAGGATATCCAGCTTCCCCGGTCTGGGGTGAGGAATGTGGGGAGC 
TTGGGCATCCTCCTCCAGCCTCCTGCAGCCCCCAGGCAGTGCCTTACCTGTGGTGCCCAGAAA 
AGTGCCCCTAGGTTGGTGGGTCTACAGGAGCCTCAGCCAGGCAGCCCACCCCACCCTGGGGCC 
CTGCCTCACCAAGGAAATAAAGACTCAAGCCATAAAAAAAA 
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FIGURE 484 

MRPGAPGPLWPLPWGALAWAVGFVSSMGSGNPAPGGVCWLQQGQEATCSLVLQTDVTRAECCA 
SGNIDTAWSNLTHPGNKINLLGFLGLVHCLPCKDSCDGVECGPGKACRMLGGRPRCECAPDCS 
GLPARLQVCGSDGATYRDECELRAARCRGHPDLSVMYRGRCRKSCEHVVCPRPQSCWDQTGS 
AHCVVCRAAPCPVPSSPGQELCGNNNVTYISSCHMRQATCFLGRSIGVRHAGSCAGTPEEPPG 
GESAEEEENFV 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation sites. 

amino acids 13-11, 215-219 

Osteonectin domain proteins. 

amino acids 97-130, 169-202 
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FIGURE 485 

GCTCGAGGCCGGCGGCGGCGGGAGAGCGACCCGGGCGGCCTCGTAGCGGGGCCCCGGATCCCC 
GAGTGGCGGCCGGAGCCTCGAAAAGAGATTCTCAGCGCTGATTTTGAGAT6ATGGGCTTGGGA 
AACGGGCGTCGCAGCATGAAGTCGCCGCCCCTCGTGCTGGCCGCCCTGGTGGCCTGCATCATC 
GTCTTGGGCTTCAACTACTGGATTGCGAGCTCCCGGAGCGTGGACCTCCAGACACGGATCATG 
GAGCTGGAAGGCAGGGTCCGCAGGGCGGCTGCAGAGAGAGGCGCCGTGGAGCTGAAGAAGAAC 
GAGTTCCAGGGAGAGCTGGAGAAGCAGCGGGAGCAGCTTGACAAAATCCAGTCCAGCCACAAC 
TTCCAGCTGGAGAGCGTCAACAAGCTGTACCAGGACGAAAAGGCGGTTTTGGTGAATAACATC 
ACCACAGGTGAGAGGCTCATCCGAGTGCTGCAAGACCAGTTAAAGACCCTGCAGAGGAATTAC 
GGCAGGCTGCAGCAGGATGTCCTCCAGTTTCAGAAGAACCAGACCAACCTGGAGAGGAAGTTC 
TCCTACGACCTGAGCCAGTGCATCAATCAGATGAAGGAGGTGAAGGAACAGTGTGAGGAGCGA 
ATAGAAGAGGTCACCAAAAAGGGGAATGAAGCTGTAGCTTCCAGAGACCTGAGTGAAAACAAC 
GACCAGAGACAGCAGCTCCAAGCCCTCAGTGAGCCTCAGCCCAGGCTGCAGGCAGCAGGCCTG 
CCACACACAGAGGTGCCACAAGGGAAGGGAAACGTGCTTGGTAACAGCAAGTCCCAGACACCA 
GCCCCCAGTTCCGAAGTGGTTTTGGATTCAAAGAGACAAGTTGAGAAAGAGGAAACCAATGAG 
ATCCAGGTGGTGAATGAGGAGCCTCAGAGGGACAGGCTGCCGCAGGAGCCAGGCCGGGAGCAG 
GTGGTGGAAGACAGACCTGTAGGTGGAAGAGGCTTCGGGGGAGCCGGAGAACTGGGCCAGACC 
CCACAGGTGCAGGCTGCCCTGTCAGTGAGCCAGGAAAATCCAGAGATGGAGGGCCCTGAGCGA 
GACCAGCTTGTCATCCCCGACGGACAGGAGGAGGAGCAGGAAGCTGCCGGGGAAGGGAGAAAC 
CAGCAGAAACTGAGAGGAGAAGATGACTACAACATGGATGAAAATGAAGCAGAATCTGAGACA 
GACAAGCAAGCAGCCCTGGCAGGGAATGACAGAAACATAGATGTTTTTAATGTTGAAGATCAG 
AAAAGAGACACCATAAATTTACTTGATCAGCGTGAAAAGCGGAATCATACACTCT^ATTGAA 
CTGGAATCACATATTTCACAACAGGGCCGAAGAGATGACTATAAAATGTTCATGAGGGACTGA 
ATACTGAAAACTGTGAAATGTACTAAATAAAATGTACATCTGA 
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FIGURE 486 

MMGLGNGRRSMKSPPLVLAALVACIIVLGFNYWIASSRSVDLQTRIMELEGRVRRAAAERGAV 
ELKKNEFQGELEKQREQLDKIQSSHNFQLESVNKLYQDEKAVLVNNITTGERLIRVLQDQLKT 
LQRNYGRLQQDVLQFQKNQTNLERKFSYDLSQCINQMKEVKEQCEERIEEVTKKGNEAVASRD 
LS ENN DQRQQLQAL S E PQ P RLQAAGL PHT E V PQG KGNVLGN S KS QT PA P S S EWL D S KRQ VEK 
EETNEIQWNEEPQRDRLPQEPGREQWEDRPVGGRGFGGAGELGQTPQVQAALSVSQENPEM 
EGPERDQLVIPDGQEEEQEAAGEGRNQQKLRGEDDYNMDENEAESETDKQAALAGNDRNIDVF 
NVEDQKRDTINLLDQREKRNHTL 

Important features : 
Signal peptide: 

amino acids 1-29 
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FIGURE 487 

AACTCAAACTCCTCTCTCTGGGAAAACGCGGTGCTTGCTCCTCCCGGAGTGGCCTTGGCAGGG 
TGTTGGAGCCCTCGGTCTGCCCCGTCCGGTCTCT.GGGGCCAAGGCTGGGTTTCCCTCMSTAT 
GGCAAGAGCTCTACTCGTGCGGTGCTTCTTCTCCTTGGCATACAGCTCACAGCTCTTTGGCCT 
ATAGCAGCTGTGGAAATTTATACCTCCCGGGTGCTGGAGGCTGTTAATGGGACAGATGCTCGG 
TTAAAATGCACTTTCTCCAGCTTTGCCCCTGTGGGTGATGCTCTAACAGTGACCTGGAATTTT 
CGTCCTCTAGACGGGGGACCTGAGCAGTTTGTATTCTACTACCACATAGATCCCTTCCAACCC 
ATGAGTGGGCGGTTTAAGGACCGGGTGTCTTGGGATGGGAATCCTGAGCGGTACGATGCCTCC 
ATCCTTCTCTGGAAACTGCAGTTCGACGACAATGGGACATACACCTGCCAGGTGAAGAACCCA 
CCTGATGTTGATGGGGTGATAGGGGAGATCCGGCTCAGCGTCGTGCACACTGTACGCTTCTCT 
GAGATCCACTTCCTGGCTCTGGCCATTGGCTCTGCCTGTGCACTGATGATCATAATAGTAATT 
GTAGTGGTCCTCTTCCAGCATTACCGGAAAAAGCGATGGGCCGAAAGAGCTCATAAAGTGGTG 
GAGATAAAATCAAAAGAAGAGGAAAGGCTCAACCAAGAGAAAAAGGTCTCTGTTTATTTAGAA 
GACACAGACTAACAATTTTAGATGGAAGCTGAGATGATTTCCAAGAACAAGAACCCTAGTATT 
TCTTGAAGTTAATGGAAACTTTTCTTTGGCTTTTCCAGTTGTGACCCGTTTTCCAACCAGTTC 
TGCAGCATATTAGATTCTAGACAAGCAACACCCCTCTGGAGCCAGCACAGTGCTCCTCCATAT 
CACCAGTCATACACAGCCTCATTATTAAGGTCTTATTTAATTTCAGAGTGTAAATTTTTTCAA 
GTGCTCATTAGGTTTTATAAACAAGAAGCTACATTTTTGCCCTTAAGACACTACTTACAGTGT 
TATGACTTGTATACACATATATTGGTATCAAAGGGGATAAAAGCCAATTTGTCTGTTACATTT 
CCTTTCACGTATTTCTTTTAGCAGCACTTCTGCTACTAAAGTTAATGTGTTTACTCTCTTTCC 
TTCCCACATTCTCAATTAAAAGGTGAGCTAAGCCTCCTCGGTGTTTCTGATTAACAGTAAATC 
CTAAATTCAAACTGTTAAATGACATTTTTATTTTTATGTCTCTCCTTAACTATGAGACACATC 
TTGTTTTACTGAATTTCTTTCAATATTCCAGGTGATAGATTTTTGTCG 
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FIGURE 488 

MYGKSSTRAVLLLLGIQLTALWPIT^AVEIYTSRVLEAVNGTDARLKCTFSSFAPVGDALTVTW 
NFRPLDGGPEQFVFYYHIDPFQPMSGRFKDRVSWDGNPERYDASILLWKLQFDDNGTYTCQVK 
NPPDVDGVIGEIRLSWHTVRFSEIHFLALAIGSACALMIIIVIWVLFQHYRKKRWAERAHK 
WEIKSKEEERLNQEKKVSVYLEDTD 
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FIGURE 489 

AAGCAACCAAACTGCAAGCTTTGGGAGT.TGTTCGCTGTCCCTGCCCTGCTCTGCTAGGGAGAG 
AACGCCAGAGGGAGGCGGCTGGCCCGGCGGCAGGCTCTCAGAACCGCTACCGGCGATGCTACT 
GCTGTGGGTGTCGGTGGTCGCAGCCTTGGCGCTGGCGGTACTGGCCCCCGGAGCAGGGGAGCA 
GAGGCGGAGAGCAGCCAAAGCGCCCAATGTGGTGCTGGTCGTGAGCGACTCCTTCGATGGAAG 
GTTAACATTTCATCCAGGAAGTCAGGTAGTGAAACTTCCTTTTATCAACTTTATGAAGACACG 
TGGGACTTCCTTTCTGAATGCCTACACA7\ACTCTCCAATTTGTTGCCCATCACGCGCAGCAAT 
GTGGAGTGGCCTCTTCACTCACTTAACAGAATCTTGGAATAATTTTAAGGGTCTAGATCCAAA 
TTATACAACATGGATGGATGTCATGGAGAGGCATGGCTACCGAACACAGAAATTTGGGAAACT 
GGACTATACTTCAGGACATCACTCCATTAGTAATCGTGTGGAAGCGTGGACAAGAGATGTTGC. 
TTTCTTACTCAGACAAGAAGGCAGGCCCATGGTTAATCTTATCCGTAACAGGACTAAAGTCAG 
AGTGATGGAAAGGGATTGGCAGAATACAGACAAAGCAGTAAACTGGTTAAGAAAGGAAGCAAT 
TAATTACACTGAACCATTTGTTATTTACTTGGGATTAAATTTACCACACCCTTACCCTTCACC 
ATCTTCTGGAGAAAATTTTGGATCTTCAACATTTCACACATCTCTTTATTGGCTTGAAAAAGT 
GTCTCATGATGCCATCAAAATCCCAAAGTGGTCACCTTTGTCAGAAATGCACCCTGTAGATTA 
TTACTCTTCTTATACAAAAAACTGCACTGGAAGATTTACAAZ^AAAAGAAATTAAGAATATTAG 
AGCATTTTATTATGCTATGTGTGCTGAGACAGATGCCATGCTTGGTGAAATTATTTTGGCCCT 
TCATCAATTAGATCTTCTTCAGAAAACTATTGTCATATACTCCTCAGACCATGGAGAGCTGGC 
CATGGAACATCGACAGTTTTATAAAATGAGCATGTACGAGGCTAGTGCACATGTTCCGCTTTT 
GATGATGGGACCAGGAATTAAAGCCGGCCTACAAGTATCAAATGTGGTTTCTCTTGTGGATAT 
TTACCCTACCATGCTTGATATTGCTGGAATTCCTCTGCCTCAGAACCTGAGTGGATACTCTTT 
GTTGCCGTTATCATCAGAAACATTTAAGAATGAACATAAAGTCAAAAACCTGCATCCACCCTG 
GATTCTGAGTGAATTCCATGGATGTAATGTGAATGCCTCCACCTACATGCTTCGAACTAACCA 
CTGGAAATATATAGCCTATTCGGATGGTGCATCAATATTGCCTCAACTCTTTGATCTTTCCTC 
GGATCCAGATGAATTAACAAATGTTGCTGTAAAATTTCCAGAAATTACTTATTCTTTGGATCA 
. GAAGCTTCATTCCATTATAAACTACCCTAAAGTTTCTGCTTCTGTCCACCAGTATAATAAAGA 
GCAGTTTATCAAGTGGAAACAAAGTATAGGACAGAATTATTCAAACGTTATAGCAAATCTTAG 
GTGGCACCAAGACTGGCAGAAGGAACCAAGGAAGTATGAAAATGCAATTGATCAGTGGCTTAA 
AACCCATATGAATCCAAGAGCAGTTTGAACAAAAAGTTTAAAAATAGTGTTCTAGAGATACAT 
ATAAATATATTACAAGATCATAATTATGTATTTTAAATGAAACAGTTTTAATAATTACCAAGT 
TTTGGCCGGGCACAGTGGCTCACACCTGTAATCCCAGGACTTTGGGAGGCTGAGGAAAGCAGA 
TCACAAGGTCAAGAGATTGAGACCATCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAA 
TACAAAAATTAGCTGGGCGCGGTGGTGCACACCTATAGTCTCAGCTACTCAGAGGCTGAGGCA 
GGAGGATCGCTTGAACCCGGGAGGCAGCAGTTGCAGTGAGCTGAGATTGCGCCACTGTACTCC 
AGCCTGGCAACAGAGTGAGACTGTGTCGCAAAAAAATAAAAATAAAATAATAATAATTACCAA 
TTTTTCATTATTTTGTAAGAATGTAGTGTATTTTAAGATAAAATGCCAATGATTATAAAATCA 
GATATTTTCAAAAATGGTTATTATTTAGGCCTTTGTACAATTTCTAACAATTTAGTGGAAGTA 
TCAAAAGGATTGAAGCAAATACTGTAACAGTTATGTTCCTTTAAATAATAGAGAATATAAAAT 
ATTGTAATAATATGTATCATAAAATAGTTGTATGTGAGCATTTGATGGTGAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAA 
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FIGURE 490 

MLLLWVSVVAALALAVLAPGAGEQRRRAAKAPNVVLWSDSFDGRLTFHPGSQVVKLPFINFM 
KTRGTSFLNAYTNSPICCPSRAAMWSGLFTHLTESWNNFKGLDPNYTTWMDVMERHGYRTQKF 
GKLDYTSGHHSISNRVEAWTRDVAFLLRQEGRPMVNLIRNRTKVRVMERDWQNTDKAVNWLRK 
EAINYTEPFVIYLGLNLPHPYPSPSSGENFGSSTFHTSLYWLEKVSHDAIKIPKWS PLSEMHP 
VDYYSSYTKNCTGRFTKKEIKNIRAFYYAMCAETDAMLGEIILALHQLDLLQKTIVIYSSDHG 
ELAMEHRQFYKMSMYEASAHVPLLMMGPGIKAGLQVSNVVSLVDIYPTMLDIAGIPLPQNLSG 
YSLLPLSSETFKNEHKVKNLHPPWILSEFHGCNVNASTYMLRTNHWKYIAYSDGASILPQLFD 
LSSDPDELTNVAVKFPEITYSLDQKLHSIINYPKVSASVHQYNKEQFIKWKQSIGQNYSNVIA 
NLRWHQDWQKEPRKYENAIDQWLKTHMNPRAV 

Important features : 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-111, 166-169, 193-196, 262-265, 375-378, 413-416, 
498-501 

Sulfatases proteins: 

amino acids 286-315, 359-369, 78-97 
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FIGURE 491 

GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCAAGA 
GCTTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGC 
CTCTCTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGT 
TGCCATGCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGT 
TGGCTTCTCCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGA 
CATCTATAGCACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGAC 
ATCCAGTGCAATCTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTT 
CTGCCAGGAATCCCGAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGG 
AGGCCTCCTGGGATTCATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTC 
ACCACTGGTGCCTGACAGCATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTC 
TTCCCTGTTCTCCCTGATAGCTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCG 
CTCCAACTACTACGATGCCTACCAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGG 
TCAACCTCCCAAAGTCAAGAGTGAGTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAAC 
CAGGGGCCAGAGCTGGGGGGTGGCTGGGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCA 
CAGGTGAGGGACACTACCACTGGATCGTGTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGG 
CCATTGGATTGAGCAAAGGCAGAAATGGGGGCTAGTGTAACAGCATGCAGGTTGAATTGCCAA 
GGATGCTCGCCATGCCAGCCTTTCTGTTTTCCTCACCTTGCTGCTCCCCTGCCCTAAGTCCCC 
AACCCTCAACTTGAAACCCCATTCCCTTAAGCCAGGACTCAGAGGATCCCTTTGCCCTCTGQT 
TTACCTGGGACTCCATCCCCAAACCCACTAATCACATCCCACTGACTGACCCTCTGTGATCAA 
AGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATTGCTGGGGATGGGAAGGAGAAGCAGT 
GGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTCCCTCCAAAGAAACTGATTGGCCC 
TGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCAGACTAATTTGTGCATGAACTG 
AAATAAAACCATCCTACGGTATCCAGGGAACAGAAAGCAGGATGCAGGATGGGAGGACAGGAA 
GGCAGCCTGGGACATTTAAAAAAATA 
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FIGURE 492 

MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQ 
CDIYSTLLGLPADIQAAQAMMVTSSAISSLACIISVVGMRCTVFCQESRAKDRVAVAGGVFFI 
LGGLLGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQR 
NRSNYYDAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 



WO 01/40466 



PCT/US00/32678 



493/550 

FIGURE 493 

GCACTGCTGCTGTCCCATCAGCTGCTCTGAAGCTCCAT6GTGCCCAGAATCTTCGCTCCTGCT 
TATGTGTCAGTCTGTCTCCTCCTCTTGTGTCCAAGGGAAGTCATCGCTCCCGCTGGCTCAGAA 
CCATGGCTGTGCCAGCCGGCACCCAGGTGTGGAGACAAGATCTACAACCCCTTGGAGCAGTGC 
TGTTACAATGACGCCATCGTGTCCCTGAGCGAGACCCGCCAATGTGGTCCCCCCTGCACCTTC 
TGGCCCTGCTTTGAGCTCTGCTGTCTTGATTCCTTTGGCCTCACAAACGATTTTGTTGTGAAG 
CTGAAGGTTCAGGGTGTGAATTCCCAGTGCCACTCATCTCCCATCTCCAGTAAATGTGAAAGC 
AGAAGACGTTTTCCCTGAGAAGACATAGAAAGAAAATCAACTTTCACTAAGGCATCTCAGAAA 
CATAGGCTAAGGTAATATGTGTACCAGTAGAGAAGCCTGAGGAATTTACAAAATGATGCAGCT 
CCAAGCCATTGTATGGCCCATGTGGGAGACTGATGGGACATGGAGAATGACAGTAGATTATCA 
GGAAATAAATAAAGTGGTTTTTCCAATGTACACACCTGTAAAA 
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FIGURE 494 

MVPRIFAPAYVSVCLLLLCPREVIAPAGSEPWLCQPAPRCGDKIYNPLEQCCYNDAIVSLSET 
RQCGPPCTFWPCFELCCLDSFGLTNDFWKLKVQGVNSQCHSSPISSKCESRRRFP 

Important features: 
Signal peptide: 

amino acids 1-25 
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FIGURE 495 

CTCCACTGCAACCACCCAGAGCCATGGCTCCCCGAGGCTGCATCGTAGCTGTCTTTGCCATTT 
TCTGCATCTCCAGGCTCCTCTGCTCACACGGAGCCCCAGTGGCCCCCATGACTCCTTACCTGA 
TGCTGTGCCAGCCACACAAGAGATGTGGGGACAAGTTCTACGACCCCCTGCAGCACTGTTGCT 
ATGATGATGCCGTCGTGCCCTTGGCCAGGACCCAGACGTGTGGAAACTGCACCTTCAGAGTCT 
GCTTTGAGCAGTGCTGCCCCTGGACCTTCATGGTGAAGCTGATAAACCAGAACTGCGACTCAG 
CCCGGACCTCGGATGACAGGCTTTGTCGCAGTGTCAGCTAATGGAACATCAGGGGAACGATGA 
CTCCTGGATTCTCCTTCCTGGGTGGGCCTGGAGAAAGAGGCTGGTGTTACCTGAGATCTGGGA 
TGCTGAGTGGCTGTTTGGGGGCCAGAGAAACACACACTCAACTGCCCACTTCATTCTGTGACC 
TGTCTGAGGCCCACCCTGCAGCTGCCCTGAGGAGGCCCACAGGTCCCCTT.CTAGAATTCTGGA 
CAGCATGAGATGCGTGTGCTGATGGGGGCCCAGGGACTCTGAACCCTCCTGATGACCCCTATG 
GCCAACATCAACCCGGCACCACCCCAAGGCTGGCTGGGGAACCCTTCACCCTTCTGTGAGATT 
TTCCATCATCTCAAGTTCTCTTCTATCCAGGAGCAAAGCACAGGATCATAATAAATTTATGTA 
CTTTATAAATGAAAA 
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FIGURE 496 

MAPRGCIVAVFAIFCISRLLCSHGAPVAPMTPYLMLCQPHKRCGDKFYDPLQHCCYDDAWPL 
ARTQTCGNCTFRVCFEQCCPWTFMVKLINQNCDSARTSDDRLCRSVS 

Important features : 
Signal peptide: 

amino acids 1-24 
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FIGURE 497 

TGAAGGACTTTTCCAGGACCCAAGGCCACACACTGGAAGTCTTGCAGCTGAAGGGAGGCACTC 
CTTGGCCTCCGCAGCCGATCACATGAAGGTGGTGCCAAGTCTCCTGCTCTCCGTCCTCCTGGC 
ACAGGTGTGGCTGGTACCCGGCTTGGCCCCCAGTCCTCAGTCGCCAGAGACCCCAGCCCCTCA 
GAACCAGACCAGCAGGGTAGTGCAGGCTCCCAGGGAGGAAGAGGAAGATGAGCAGGAGGCCAG 
CGAGGAGAAGGCCGGTGAGGAAGAGAAAGCCTGGCTGATGGCCAGCAGGCAGCAGCTTGCCAA 
GGAGACTTCAAACTTCGGATTCAGCCTGCTGCGAAAGATCTCCATGAGGCACGATGGCAACAT 
GGTCTTCTCTCCATTTGGCATGTCCTTGGCCATGACAGGCTTGATGCTGGGGGCCACAGGGCC 
GACTGAAACCCAGATCAAGAGAGGGCTCCACTTGCAGGCCCTGAAGCCCACCAAGCCCGGGCT 
CCTGCCTTCCCTCTTTAAGGGACTCAGAGAGACCCTCTCCCGCAACCTGGAACTGGGCCTCTC 
ACAGGGGAGTTTTGCCTTCATCCACAAGGATTTTGATGTCAAAGAGACTTTCTTCAATTTATC 
CAAGAGGTATTTTGATACAGAGTGCGTGCCTATGAATTTTCGCAATGCCTCACAGGCCAAAAG 
GCTCATGAATCATTACATTAACAAAGAGACTCGGGGGAAAATTCCCAAACTGTTTGATGAGAT 
TAATCCTGAAACCAAATTAATTCTTGTGGATTACATCTTGTTCAAAGGGAAATGGTTGACCCC 
ATTTGACCCTGTCTTCACCGAAGTCGACACTTTCCACCTGGACAAGTACAAGACCATTAAGGT 
GCCCATGATGTACGGTGCAGGCAAGTTTGCCTCCACCTTTGACAAGAATTTTCGTTGTCATGT 
CCTCAAACTGCCCTACCAAGGAAATGCCACCATGCTGGTGGTCCTCATGGAGAAAATGGGTGA 
CCACCTCGCCCTTGAAGACTACCTGACCACAGACTTGGTGGAGACATGGCTCAGAAACATGAA 
AACCAGAAACATGGAAGTTTTCTTTCCGAAGTTCAAGCTAGATCAGAAGTATGAGATGCATGA 
GCTGCTTAGGCAGATGGGAATCAGAAGAATCTTCTCACCCTTTGCTGACCTTAGTGAACTCTC 
AGCTACTGGAAGAAATCTCCAAGTATCCAGGGTTTTACGAAGAACAGTGATTGAAGTTGATGA 
AAGGGGCACTGAGGCAGTGGCAGGAATCTTGTCAGAAATTACTGCTTATTCCATGCCTCCTGT 
CATCAAAGTGGACCGGCCATTTCATTTCATGATCTATGAAGAAACCTCTGGAATGCTTCTGTT 
TCTGGGCAGGGTGGTGAATCCGACTCTCCTATAATTCAGGACATGCATAAGCACTTCGTGCTG 
TAGTAGATGCTGAATCTGAGGTATCAAACACACACAGGATACCAGCAATGGATGGCAGGGGAG 
AGTGTTCCTTTTGTTCTTAACTAGTTTAGGGTGTTCTCAAATAAATACAGTAGTCCCCACTTA 
TCTGAGGGGGATACATTCAAAGACCCCCAGCAGATGCCTGAAACGGTGGACAGTGCTGAACCT 
TATATATATTTTTTCCTACACATACATAGCTATGATAAAGTTTAATTTATAAATTAGGCACAG 
TAAGAGATTAACAATAATAACAACATTAAGTAAAATGAGTTACTTGAACGCAAGCACTGCAAT 
ACCATAACAGTCAAACTGATTATAGAGAAGGCTACTAAGTGACTCATGGGCGAGGAGCATAG^ 
CAGTGTGGAGACATTGGGCAAGGGGAGAATTCACATCCTGGGTGGGACAGAGCAGGACGATGC 
AAGATTCCATCCCACTACTCAGAATGGCATGCTGCTTAAGACTTTTAGATTGTTTATTTCTGG 
AATTTTTCATTTAATGTTTTTGGACCATGGTTGACCATGGTTAACTGAGACTGCAGAAAGCAA 
AACCATGGATAAGGGAGGACTACTACAAAAGCATTAAATTGATACATATTTTTTAAAAAAAAA 
AAAAAAAAAA 
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FIGURE 498 

MKVVPSLLLSVLLAQVWLVPGLAPSPQSPETPAPQNQTSRVVQAPREEEEDEQEASEEKAGEE 
EKAWLMASRQQLAKETSNFGFSLLRKISMRHDGNMVFSPFGMSLAMTGLMLGATGPTETQIKR 
GLHLQALKPTKPGLLPSLFKGLRETLSRNLELGLSQGSFAFIHKDFDVKETFFNLSKRYFDTE 
CVPMNFRNASQAKRLMNHYINKETRGKIPKLFDEINPETKLILVDYILFKGKWLTPFDPVFTE 
VDTFHLDKYKTIKVPMMYGAGKFASTFDKNFRCHVLKLPYQGNATMLVVLMEKMGDHLALEDY 
LTTDLVETWLRNMKTRNMEVFFPKFKLDQKYEMHELLRQMGIRRIFSPFADLSELSATGRNLQ 
VSRVLRRTVIEVDERGTEAVAGILSEITAYSMPPVIKVDRPFHFMIYEETSGMLLFLGRWNP 
TLL 
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FIGURE 499 

CTAGCCTGCGCCAAGGGGTAGTGAGACCGCGCGGCAACAGCTTGCGGCTGCGGGGAGCTCCCG 
TGGGCGCTCCGCTGGCTGTGCAGGCGGCCATGGATTCCTTGCGGAAAATGCTGATCTCAGTCG 
CAATGCTGGGCGCAGGGGCTGGCGTGGGCTACGCGCTCCTCGTTATCGTGACCCCGGGAGAGC 
GGCGGAAGCAGGAAATGCTAAAGGAGATGCCACTGCAGGACCCAAGGAGCAGGGAGGAGGCGG 
CCAGGACCCAGCAGCTATTGCTGGCCACTCTGCAGGAGGCAGCGACCACGCAGGAGAACGTGG 
CCTGGAGGAAGAACTGGATGGTTGGCGGCGAAGGCGGCGCCAGCGGGAGGTCACCGTGAGACC 
GGACTTGCCTCCGTGGGCGCCGGACCTTGGCTTGGGCGCAGGAATCCGAGGCAGCCTTTCTCC 
TTCGTGGGCCCAGCGGAGAGTCCGGACCGAGATACCATGCCAGGACTCTCCGGGGTCCTGTGA 
GCTGCCGTCGGGTGAGCACGTTTCCCCCAAACCCTGGACTGACTGCTTTAAGGTCCGCAAGGC 
GGGCCAGGGCCGAGACGCGAGTCGGATGTGGTGAACTGAAAGAACCAATAAAATCATGTTCCT 
CCAAAAAAAAAAAAAAAAAAAAAAA^AAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
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FIGURE 500 

MDSLRKMLISVAMLGAGAGVGYALLVIVTPGERRKQEMLKEMPLQDPRSREEAARTQQLLLAT 
LQEAATTQENVAWRKNWMVGGEGGASGRSP 

Important features: 
Signal peptide: 

amino acids 1-18 . 
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FIGURE 501 

CAGGAGAGAAGGCACCGCCCCCACCCCGCCTCCAAAGCTAACCCTCGGGCTTGAGGGGAAGAG 
GCTGACTGTACGTTCCTTCTACTCTGGCACCACTCTCCAGGCTGCCATGGGGCCCAGCACCCC 
TCTCCTCATCTTGTTCCTTTTGTCATGGTCGGGACCCCTCCAAGGACAGCAGCACCACCTTGT 
GGAGTACATGGAACGCCGACTAGCTGCTTTAGAGGAACGGCTGGCCCAGTGCGAGGACCAGAG 
TAGTCGGCATGCTGCTGAGCTGCGGGACTTCAAGAACAAGATGCTGCCACTGCTGGAGGTGGC 
AGAGAAGGAGCGGGAGGCACTCAGAACTGAGGCCGACACCATCTCCGGGAGAGTGGATCGTCT 
GGAGCGGGAGGTAGACTATCTGGAGACCCAGAACCCAGCTCTGCCCTGTGTAGAGTTTGATGA 
GAAGGTGACTGGAGGCCCTGGGACCAAAGGCAAGGGAAGAAGGAATGAGAAGTACGATATGGT 
GACAGACTGTGGCTACACAATCTCTCAAGTGAGATCAATGAAGATTCTGAAGCGATTTGGTGG 
CCCAGCTGGTCTATGGACCAAGGATCCACTGGGGCAAACAGAGAAGATCTACGTGTTAGATGG 
GACACAGAATGACACAGCCTTTGTCTTCCCAAGGCTGCGTGACTTCACCCTTGCCATGGCTGC 
CCGGAAAGCTTCCCGAGTCGGGGTGCCCTTCCCCTGGGTAGGCACAGGGCAGCTGGTATATGG 
TGGCTTTCTTTATTTTGCTCGGAGGCCTCCTGGAAGACCTGGTGGAGGTGGTGAGATGGAGAA 
CACTTTGCAGCTAATCAAATTCCACCTGGCAAACCGAACAGTGGTGGACAGCTCAGTATTCCC 
AGCAGAGGGGCTGATCCCCCCCTACGGCTTGACAGCAGACACCTACATCGACCTGGTAGCTGA 
TGAGGAAGGTCTTTGGGCTGTCTATGCCACCCGGGAGGATGACAGGCACTTGTGTCTGGCCAA 
GTTAGATCCACAGACACTGGACACAGAGCAGCAGTGGGACACACCATGTCCCAGAGAGAATGC 
TGAGGCTGCCTTTGTCATCTGTGGGACCCTCTATGTCGTCTATAACACCCGTCCTGCCAGTCG 
GGCCCGCATCCAGTGCTCCTTTGATGCCAGCGGCACCCTGACCCCTGAACGGGCAGCACTCCC 
TTATTTTCCCCGCAGATATGGTGCCCATGCCAGCCTCCGCTATAACCCCCGAGAACGCCAGCT 
CTATGCCTGGGATGATGGCTACCAGATTGTCTATAAGCTGGAGATGAGGAAGAAAGAGGAGGA 
GGTTTGAGGAGCTAGCCTTGTTTTTTGCATCTTTCTCACTCCCATACATTTATATTATATCCC 
CACTAAATTTCTTGTTCCTCATTCTTCAAATGTGGGCCAGTTGTGGCTCAAATCCTCTATATT 
TTTAGCCAATGGCAATCAAATTCTTTCAGCTCCTTTGTTTCATACGGAACTCCAGATCCTGAG 
TAATCCTTTTAGAGCCCGAAGAGTCAAAACCCTCAATGTTCCCTCCTGCTCTCCTGCCCCATG 
TCAACAAATTTCAGGCTAAGGATGCCCCAGACCCAGGGCTCTAACCTTGTATGCGGGCAGGCC 
CAGGGAGCAGGCAGCAGTGTTCTTCCCCTCAGAGTGACTTGGGGAGGGAGAAATAGGAGGAGA 
CGTCCAGCTCTGTCCTCTCTTCCTCACTCCTCCCTTCAGTGTCCTGAGGAACAGGACTTTCTC 
CACATTGTTTTGTATTGCAACATTTTGCATTAAAAGGAAAATCCACAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



WO 01/40466 



PCT/US00/32678 



502/550 

FIGURE 502 

MGPSTPLLILFLLSWSGPLQGQQHHLVEYMERRLAALEERLAQCQDQSSRHAAELRDFKNKML 
PLLEVAEKEREALRTEADTISGRVDRLEREVDYLETQNPALPCVEFDEKVTGGPGTKGKGRRN 
EKYDMVTDCGYTISQVRSMKILKRFGGPAGLWTKDPLGQTEKIYVLDGTQNDTAFVFPRLRDF 
TLAMAARKASRVRVPFPWVGTGQLVYGGFLYFARRPPGRPGGGGEMENTLQLIKFHLANRTW 
DSSVFPAEGLIPPYGLTADTYIDLVADEEGLWAVYATREDDRHLCLAKLDPQTLDTEQQWDTP 
CPRENAEAAFVICGTLYWYNTRPASRARIQCSFDASGTLTPERAALPYFPRRYGAHASLRYN 
PRERQLYAWDDGYQIVYKLEMRKKEEEV 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites . 

amino acids 177-180, 248-251 
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FIGURE 503 

TGCGGCGCAGTGTAGACCTGGGAGQaSSQGCGGCCTGCTGCTGGCTGCTTTTCTGGCTTTGGT 
CTCGGTGCCCAGGGCCCAGGCCGTGTGGTTGGGAAGACTGGACCCTGAGCAGCTTCTTGGGCC 
CTGG'TACGTGCTTGCGGTGGCCTCCCGGGAAAAGGGCTTTGCCATGGAGAAGGACATGAAGAA 
CGTCGTGGGGGTGGTGGTGACCCTCACTCCAGAAAACAACCTGCGGACGCTGTCCTCTCAGCA 
CGGGCTGGGAGGGTGTGACCAGAGTGTCATGGACCTGATAAAGCGAAACTCCGGATGGGTGTT 
TGAGAATCCCTCAATAGGCGTGCTGGAGCTCTGGGTGCTGGCCACCAACTTCAGAGACTATGC 
CATCATCTTCACTCAGCTGGAGTTCGGGGACGAGCCCTTCAACACCGTGGAGCTGTACAGTCT 
GACGGAGACAGCCAGCCAGGAGGCCATGGGGCTCTTCACCAAGTGGAGCAGGAGCCTGGGCTT 
CCTGTCACAGTAGCAGGCCCAGCTGCAGAAGGACCTCACCTGTGCTCACAAGATCCTTCTGTG 
AGTGCTGCGTCCCCAGTAGGGATGGCGCCCACAGGGTCCTGTGACCTCGGCCAGTGTCCACCC 
ACCTCGCTCAGCGGCTCCCGGGGCCCAGCACCAGCTCAGAATAAAGCGATTCCACAGCA 
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FIGURE 504 

MGGLLLAAFLALVSVPRAQAVWLGRLDPEQLLGPWYVLAVASREKGFAMEKDMKNWGVWTL 
TPENNLRTLSSQHGLGGCDQSVMDLIKRNSGWVFENPSIGVLELWVLATNFRDYAIIFTQLEF 
GDEPFNTVELYSLTETASQEAMGLFTKWSRSLGFLSQ 

Important features: 
Signal peptide: 

amino acids 1-20 
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FIGURE 505 

GTTCCGCAGATGCAGAGGTTGAGGTGGCTGCGGGACTGGAAGTCATCGGGCAGAGGTCTCACA 

GCAGCCAAGGAACCTGGGGCCCGCTCCTCCCCCCTCCAGGCCATGAGGATTCTGCAGTTAATC 

CTGCTTGCTCTGGCAACAGGGCTTGTAGGGGGAGAGACCAGGATCATCAAGGGGTTCGAGTGC 

AAGCCTCACTCCCAGCCCTGGCAGGCAGCCCTGTTCGAGAAGACGCGGCTACTCTGTGGGGCG 

ACGCTCATCGCCCCCAGATGGCTCCTGACAGCAGCCCACTGCCTCAAGCCCCGCTACATAGTT 

CACCTGGGGCAGCACAACCTCCAGAAGGAGGAGGGCTGTGAGCAGACCCGGACAGCCACTGAG 

TCCTTCCCCCACCCCGGCTTCAACAACAGCCTCCCCAACAAAGACCACCGCAATGACATCATG 

CTGGTGAAGATGGCATCGCCAGTCTCCATCACCTGGGCTGTGCGACCCCTCACCCTCTCCTCA 

CGCTGTGTCACTGCTGGCACCAGCTGCCTCATTTCCGGCTGGGGCAGCACGTCCAGCCCCCAG 

TTACGCCTGCCTCACACCTTGCGATGCGCCAACATCACCATCATTGAGCACCAGAAGTGTGAG 

AACGCCTACCCCGGCAACATCACAGACACCATGGTGTGTGCCAGCGTGCAGGAAGGGGGCAAG 

GACTCCTGCCAGGGTGACTCCGGGGGCCCTCTGGTCTGTAACCAGTCTCTTCAAGGCATTATC 

TCCTGGGGCCAGGATCCGTGTGCGATCACCCGAAAGCCTGGTGTCTACACGAAAGTCTGCAAA 

TATGTGGACTGGATCCAGGAGACGATGAAGAACAATTAGACTGGACCCACCCACCACAGCCCA 

TCACCCTCCATTTCCACTTGGTGTTTGGTTCCTGTTCACTCTGTTAATAAGAAACCCTAAGCC 

AAGACCCTCTACGAACATTCTTTGGGCCTCCTGGACTACAGGAGATGCTGTCACTTAATAATC 

AACCTGGGGTTCGAAATCAGTGAGACCTGGATTCAAATTCTGCCTTGAAATATTGTGACTCTG 

GGAATGACAACACCTGGTTTGTTCTCTGTTGTATCCCCAGCCCCAAAGACAGCTCCTGGCCAT 

ATATCAAGGTTTCAATAAATATTTGCTAAATGAAAJ^^ 

AAAAAAA 
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FIGURE 506 

MRILQLILLALATGLVGGETRIIKGFECKPHSQPWQAALFEKTRLLCGATLIAPRWLLTAAHC 
LKPRYIVHLGQHNLQKEEGCEQTRTATESFPHPGFNNSLPNKDHRNDIMLVKMASPVSITWAV 
RPLTLSSRCVTAGTSCLISGWGSTSSPQLRLPHTLRCANITIIEHQKCENAYPGNITDTMVCA 
SVQEGGKDSCQGDSGGPLVCNQSLQGIISWGQDPCAITRKPGVYTKVCKYVDWIQETMKNN 

Important features: 
Signal peptide: 

amino acids 1-18 

Serine proteases, trypsin family, histidine active site. 

amino acids 58-63 

N-glycosylation sites. 

amino acids 99-102, 165-168, 181-184, 210-213 

Glycosaminoglycan attachment site. 

amino acids 145-148 
* 

Kringle domain proteins. 

amino acids 197-209, 47-64 

Serine proteases, trypsin family, histidine protein 

amino acids 199-209, 47-63, 220-243 

Apple domain proteins 

amino acids 222-249, 189-222 
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FIGURE 507 

CTGGGATCAGCCACTGCAGCTCCCTGAGCACTCTCTACAGAGACGCGGACCCCAGACATGAGG 
AGGCTCCTCCTGGTCACCAGCCTGGTGGTTGTGCTGCTGTGGGAGGCAGGTGCAGTCCCAGCA 
CCCAAGGTCCCTATCAAGATGCAAGTCAAACACTGGCCCTCAGAGCAGGACCCAGAG7VAGGCC 
TGGGGCGCCCGTGTGGTGGAGCCTCCGGAGAAGGACGACCAGCTGGTGGTGCTGTTCCCTGTC 
CAGAAGCCGAAACTCTTGACCACCGAGGAGAAGCCACGAGGTCAGGGCAGGGGCCCCATCCTT 
CCAGGCACCAAGGCCTGGATGGAGACCGAGGACACCCTGGGCCGTGTCCTGAGTCCCGAGCCC 
GACCATGACAGCCTGTACCACCCTCCGCCTGAGGAGGACCAGGGCGAGGAGAGGCCCCGGTTG 
TGGGTGATGCCAAATCACCAGGTGCTCCTGGGACCGGAGGAAGACCAAGACCACATCTACCAC 
CCCCAGTAGGGCTCCAGGGGCCATCACTGCCCCCGCCCTGTCCCAAGGCCCAGGCTGTTGGGA 
CTGGGACCCTCCCTACCCTGCCCCAGCTAGACAAATAAACCCCAGCAGGCAAAAAAAAAAAAA 
AAAAAA 
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FIGURE 508 

MRRLLLVTSLVVVLLWEAGAVPAPKVPIKMQVKHWPSEQDPEKAWGARVVEPPEKDDQLWLF . 

PVQKPKLLTTEEKPRGQGRGPILPGTKAWMETEDTLGRVLSPEPDHDSLYHPPPEEDQGEERP 

RLWVMPNHQVLLGPEEDQDHIYHPQ 
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FIGURE 509 

GCGGAGCCGGCGCCGGCTGCGCAGAGGAGCCGCTCTCGCCGCCGCCACCTCGGCTGGGAGCCC 
ACGAGGCTGCCGCATCCTGCCCTCGGAACAATGGGACTCGGCGCGCGAGGTGCTTGGGCCGCG 
CTGCTCCTGGGGACGCTGCAGGTGCTAGCGCTGCTGGGGGCCGCCCATGAAAGCGCAGCCATG 
GCGGCATCTGCAAACATAGAGAATTCTGGGCTTCCACACAACTCCAGTGCTAACTCAACAGAG 
ACTCTCCAACATGTGCCTTCTGACCATACAAATGAAACTTCCAACAGTACTGTGAAACCACCA 
ACTTCAGTTGCCTCAGACTCCAGTAATACAACGGTCACCACCATGAAACCTACAGCGGCATCT 
AATACAACAACACCAGGGATGGTCTCAACAAATATGACTTCTACCACCTTAAAGTCTACACCC 
AAAACAACAAGTGTTTCACAGAACACATCTCAGATATCAACATCCACAATGACCGTAACCCAC 
AATAGTTCAGTGACATCTGCTGCTTCATCAGTAACAATCACAACAACTATGCATTCTGAAGCA 
AAGAAAGGATCAAAATTTGATACTGGGAGCTTTGTTGGTGGTATTGTATTAACGCTGGGAGTT 
TTATCTATTCTTTACATTGGATGCAAAATGTATTACTCAAGAAGAGGCATTCGGTATCGAACC 
ATAGATGAACATGATGCCATCATTTAAGGAAATCCATGGACCAAGGATGGAATACAGATTGAT 
GCTGCCCTATCAATTAATTTTGGTTTATTAATAGTTTAAAACAATATTCTCTTTTTGAAAATA 
GTATAAACAGGCCATGCATATAATGTACAGTGTATTACGTAAATATGTAAAGATTCTTCAAGG 
TAACAAGGGTTTGGGTTTTGAAATAAACATCTGGATCTTATAGACCGTTCATACAATGGTTTT 
AGCAAGTTCATAGTAAGACAAACAAGTCCTATCTTTTTTTTTTGGCTGGGGTGGGGGCATTGG 
TCACATATGACCAGTAATTGAAAGACGTCATCACTGAAAGACAGAATGCCATCTGGGCATACA 
AATAAGAAGTTTGTCACAGCACTCAGGATTTTGGGTATCTTTTGTAGCTCACATAAAGAACTT 
CAGTGCTTTTCAGAGCTGGATATATCTTAATTACTAATGCCACACAGAAATTATACAATCAAA 
CTAGATCTGAAGCATAATTTAAGAAAAACATCAACATTTTTTGTGCTTTAAACTGTAGTAGTT 
GGTCTAGAAACAAAATACTCC 
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FIGURE 510 

MGLGARGAWAALLLGTLQVLALLGAAHESAAMAASANIENSGLPHNSSANSTETLQHVPSDHT 
NETSNSTVKPPTSVASDSSNTTVTTMKPTAASNTTTPGMVSTNMTSTTLKSTPKTTSVSQNTS 
QISTSTMTVTHNSSVTSAASSVTITTTMHSEAKKGSKFDTGSFVGGIVLTLGVLSILYIGCKM' 
YYSRRG I RYRT I DEHDAI I 
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FIGURE 511 

GACTTTGCTTGAATGTTTACATTTTCTGCTCGCTGTCCTACATATCACAATATAGTGTTCACGTTTTGTTAAAAC 
TTTGGGGTGTGAGGAGTTGAGCTTGCTCAGCAAGCCAGCATGGCTAGGATGAGCTTTGTTATAGCAGCTTGCCAA 
TTGGTGCTGGGCCTACTAATGACTTCATTAACCGAGTCTTCCATACAGAATAGTGAGTGTCCACAACTTTGCGTA 
TGTGAAATTCGTCCCTGGTTTACCCCACAGTCAACTTACAGAGAAGCCACCACTGTTGATTGCAATGACCTCCGC 
TTAACAAGGATTCCCAGTAACCTCTCTAGTGACACACAAGTGCTTCTCTTACAGAGCAATAACATCGCGAAGACT 
GTGGATGAGCTGCAGCAGCTTTTCAACTTGACTGAACTAGATTTCTCCCAAAACAACTTTACTAACATTAAGGAG 
GTCGGGCTGGCAAACCTAACCCAGCTCACAACGCTGCATTTGGAGGAAAATCAGATTACCGAGATGACTGATTAC 
TGTCTACAAGACCTCAGCAACCTTCAAGAACTCTACATCAACCACAACCAAATTAGCACTATTTCTGCTCATGCT 
TTTGCAGGCTTAAAAAATCTATTAAGGCTCCACCTGAACTCCAACAAATTGAAAGTTATTGATAGTCGCTGGTTT 
GATTCTACACCCAACCTGGAAATTCTCATGATCGGAGAAAACCCTGTGATTGGAATTCTGGATATGAACTTCAAA 
CCCCTCGCAAATTTGAGAAGCTTAGTTTTGGCAGGAATGTATCTCACTGATATTCCTGGAAATGCTTTGGTGGGT 
CTGGATAGCCTTGAGAGCCTGTCTTTTTATGATAACAAACTGGTTAAAGTCCCTCAACTTGCCCTGCAAAAAGTT 
CCAAATTTGAAATTCTTAGACCTCAACAAAAACCCCATTCACAAAATCCAAGAAGGGGACTTCAAAAATATGCTT 
CGGTTAAAAGAACTGGGAATCAACAATATGGGCGAGCTCGTTTCTGTCGACCGCTATGCCCTGGATAACTTGCCT 
GAACTCACAAAGCTGGAAGCCACCAATAACCCTAAACTCTCTTACATCCACCGCTTGGCTTTCCGAAGTGTCCCT 
GCTCTGGAAAGCTTGATGCTGAACAACAATGCCTTGAATGCCATTTACCAAAAGACAGTCGAATCCCTCCCCAAT 
CTGCGTGAGATCAGTATCCATAGCAATCCCCTCAGGTGTGACTGTGTGATCCACTGGATTAACTCCAACAAAACC 
AACATCCGCTTCATGGAGCCCCTGTCCATGTTCTGTGCCATGCCGCCCGAATATAAAGGGCACCAGGTGAAGGAA 
GTTTTAATCCAGGATTCGAGTGAACAGTGCCTCCCAATGATATCTCACGACAGCTTCCCAAATCGTTTAAACGTG 
GATATCGGCACGACGGTTTTCCTAGACTGTCGAGCCATGGCTGAGCCAGAACCTGAAATTTACTGGGTCACTCCC 
ATTGGAAATAAGATAACTGTGGAAACCCTTTCAGATAAATACAAGCTAAGTAGCGAAGGTACCTTGGAAATATCT 
AACATACAAATTGAAGACTCAGGAAGATACACATGTGTTGCCCAGAATGTCCAAGGGGCAGACACTCGGGTGGCA 
ACAATTAAGGTTAACGGGACCCTTCTGGATGGTACCCAGGTGCTAAAAATATACGTCAAGCAGACAGAATCCCAT 
TCCATCTTAGTGTCCTGGAAAGTTAATTCCAATGTCATGACGTCAAACTTAAAATGGTCGTCTGCCACCATGAAG 
ATTGATAACCCTCACATAACATATACTGCCAGGGTCCCAGTCGATGTCCATGAATACAACCTAACGCATCTGCAG 
CCTTCCACAGATTATGAAGTGTGTCTCACAGTGTCCAATATTCATCAGCAGACTCAAAAGTCATGCGTAAATGTC 
ACAACCAAAAATGCCGCCTTCGCAGTGGACATCTCTGATCAAGAAACCAGTACAGCCCTTGCTGCAGTAATGGGG 
TCTATGTTTGCCGTCATTAGCCTTGCGTCCATTGCTGTGTACTTTGCCAAAAGATTTAAGAGAAAAAACTACCAC 
CACTCATTAAAAAAGTATATGCAAAAAACCTCTTCAATCCCACTAAATGAGCTGTACCCACCACTCATTAACCTC 
TGGGAAGGTGACAGCGAGAAAGACAAAGATGGTTCTGCAGACACCAAGCCAACCCAGGTCGACACATCCAGAAGC 
TATTACATGTGG TAA CTCAGAGGATATTTTGCTTCTGGTAGTAAGGAGCACAAAGACGTTTTTGCTTTATTCTGC 
AAAAGTGAACAAGTTGAAGACTTTTGTATTTTTGACTTTGCTAGTTTGTGGCAGAGTGGAGAGGACGGGTGGATA 
TTTCAAATTTTTTTAGTATAGCGTATCGCAAGGGTTTGACACGGCTGCCAGCGACTCTAGGCTTCCAGTCTGTGT 
TTGGTTTTTATTCTTATCATTATTATGATTGTTATTATATTATTATTTTATTTTAGTTGTTGTGCTAAACTCAAT 
AATGCTGTTCTAACTACAGTGCTCAATAAAATGATTAATGACAGGAAAAA7VAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 512 

MARMSFVIAACQLVLGLLMTSLTESSIQNSECPQLCVCEIRPWFTPQSTYREATTVDCNDLRL 
TRIPSNLSSDTQVLLLQSNNIAKTVDELQQLFNLTELDFSQNNFTNIKEVGLANLTQLTTLHL 
EENQITEMTDYCLQDLSNLQELYINHNQISTISAHAFAGLKNLLRLHLNSNKLKVIDSRWFDS 
TPNLEILMIGENPVIGILDMNFKPLANLRSLVLAGMYLTDIPGNALVGLDSLESLSFYDNKLV 
KVPQLALQKVPNLKFLDLNKNPIHKIQEGDFKNMLRLKELGINNMGELVSVDRYALDNLPELT 
KLEATNNPKLSYIHRLAFRSVPALESLMLNNNALNAIYQKTVESLPNLREISIHSNPLRCDCV 
IHWINSNKTNIRFMEPLSMFCAMPPEYKGHQVKEVLIQDSSEQCLPMISHDSFPNRLNVDIGT 
TVFLDCRAMAEPEPEIYWVTPIGNKITVETLSDKYKLSSEGTLEISNIQIEDSGRYTCVAQNV 
QGADTRVATIKVNGTLLDGTQVLKIYVKQTESHSILVSWKVNSNVMTSNLKWSSATMKIDNPH 
ITYTARVPVDVHEYNLTHLQPSTDYEVCLTVSNIHQQTQKSCVNVTTKNAAFAVDISDQETST 
ALAAVMGSMFAVISLASIAVYFAKRFKRKNYHHSLKKYMQKTSSIPLNELYPPLINLWEGDSE 
KDKDGSADTKPTQVDTSRSYYMW 

Important features: 
Signal peptide: 

Amino acids 1-25 

Transmembrane domain: 

Amino acids 508-530 

N-glycosylation sites: 

Amino acids 69-73; 96-100; 106-110; 117-121; 385-389; 517-521; 
582-586;611-615 

Tyrosine kinase phosphorylation site: 

Amino acids 573-582 

N-myristoylation sites: 

Amino acids 16-22; 224-230; 464-470; 637-643; 698-704 
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FIGURE 513 

GGGAGAGAGGATAAATAGCAGCGTGGCTTCCCTGGCTCCTCTCTGCATCCTTCCCGACCTTCC 
CAGCAATATGCATCTTGCACGTCTGGTCGGCTCCTGCTCCCTCCTTCTGCTACTGGGGGCCCT 
GTCTGGATGGGCGGCCAGCGATGACCCCATTGAGAAGGTCATTGAAGGGATCAACCGAGGGCT 
GAGCAATGCAGAGAGAGAGGTGGGCAAGGCCCTGGATGGCATCAACAGTGGAATCACGCATGC 
CGGAAGGGAAGTGGAGAAGGTTTTCAACGGACTTAGCAACATGGGGAGCCACACCGGCAAGGA 
GTTGGACAAAGGCGTCCAGGGGCTCAACCACGGCATGGACAAGGTTGCCCATGAGATCAACCA 
TGGTATTGGACAAGCAGGAAAGGAAGCAGAGAAGCTTGGCCATGGGGTCAACAACGCTGCTGG 
ACAGGCCGGGAAGGAAGCAGACAAAGCGGTCCAAGGGTTCCACACTGGGGTCCACCAGGCTGG 
GAAGGAAGCAGAGAAACTTGGCCAAGGGGTCAACCATGCTGCTGACCAGGCTGGAAAGGAAGT 
GGAGAAGCTTGGCCAAGGTGCCCACCATGCTGCTGGCCAGGCCGGGAAGGAGCTGCAGAATGC 
TCATAATGGGGTCAACCAAGCCAGCAAGGAGGCCAACCAGCTGCTGAATGGCAACCATCAAAG 
CGGATCTTCCAGCCATCAAGGAGGGGCCACAACCACGCCGTTAGCCTCTGGGGCCTCAGTCAA 
CACGCCTTTCATCAACCTTCCCGCCCTGTGGAGGAGCGTCGCCAACATCATGCCC2SAACTGG 
CATCCGGCCTTGCTGGGAGAATAATGTCGCCGTTGTCACATCAGCTGACATGACCTGGAGGGG 
TTGGGGGTGGGGGACAGGTTTCTGAAATCCCTGAAGGGGGTTGTACTGGGATTTGTGAATAAA 
CTTGATACACCA 
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FIGURE 514 

MHLARLVGSCSLLLLLGALSGWAASDDPIEKVIEGINRGLSNAEREVGKALDGINSGITHAGR 
EVEKVFNGLSNMGSHTGKELDKGVQGLNHGMDKVAHEINHGIGQAGKEAEKLGHGVNNAAGQA 
GKEADKAVQG FHTGVHQAGKE AEKLGQGVNHAADQAGKE VEKLGQGAHHAAGQAGKELQNAHN 
GVNQASKEANQLLNGNHQSGSSSHQGGATTTPLASGASVNTPFINLPALWRSVANIMP 

Important features: 
Signal peptide: 

amino acids 1-25 

Homologous region to circumsporozoite (CS) repeats: 

amino acids 35-225 
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FIGURE 515 

CCCACGCGTCCGCCCACGCGTCCGGGTGCCACTCGCGCGCCGGCCGCGCTCCGGGCTTCTCTT 
TTCCCTCCGACGCGCCACGGCTGCCCAGACATTCCGGCTGCCGGGTCTGGAGAGCTCCCCGAA 
CCCCTCCGCGGAGAGGAGCGAGGCGGCGCCAGGGTGGCCCCCGGGGCGCGCTTGGTCTCGGAG 
AAGCGGGGACGAGGCCGGAGGATGAGCGACTGAGGGCGACGCGGGCACTGACGCGAGTTGGGG 
CCGCGACTACCGGCAGCTGACAGCGCGATGAGCGACTCCCCAGAGACGCCCTAGCCCGGTGTG 
CGCGCCAGGCGGAGCGCGCAGGTGGGGCfTGGGCTGTTAGTGGTCCGCCCCACGCGGGTCGCCG 
GCCGGCCCAGGATGGGCGCTGGCAACCCGGGCCCGCGCCCGCCGCTGCTACCCCTGCGCCCGC 
TGCGAGCCCGGCGTCCGGCCCGCGCCCTGCGCTCATGGACGGCGGCTCCCGGCTGGCGGCGGC 
GCGCCCCCGGGCTGTGAATGCGACTCGCCCCTCGGCCGCGCTCCCCGCCCGCCCGCCCGCCGG 
GACGTGGTAGGGGATGCCCAGCTCCACTGCGATGGCAGTTGGCGCGCTCTCCAGTTCCCTCCT 
GGTCACCTGCTGCCTGATGGTGGCTCTGTGCAGTCCGAGCATCCCGCTGGAGAAGCTGGCCCA 
GGCACCAGAGCAGCCGGGCCAGGAGAAGCGTGAGCACGCCACTCGGGACGGCCCGGGGCGGGT 
GAACGAGCTCGGGCGCCCGGCGAGGGACGAGGGCGGCAGCGGCCGGGACTGGAAGAGCAAGAG 
CGGCCGTGGGCTCGCCGGCCGTGAGCCGTGGAGCAAGCTGAAGCAGGCCTGGGTCTCCCAGGG 
CGGGGGCGCCAAGGCCGGGGATCTGCAGGTCCGGCCCCGCGGGGACACCCCGCAGGCGGAAGC 
CCTGGCCGCAGCCGCCCAGGACGCGATTGGCCCGGAACTCGCGCCCACGCCCGAGCCACCCGA 
GGAGTACGTGTACCCGGACTACCGTGGCAAGGGCTGCGTGGACGAGAGCGGCTTCGTGTACGC 
GATCGGGGAGAAGTTCGCGCCGGGCCCCTCGGCCTGCCCGTGCCTGTGCACCGAGGAGGGGCC 
GCTGTGCGCGCAGCCCGAGTGCCCGAGGCTGCACCCGCGCTGCATCCACGTCGACACGAGCCA 
GTGCTGCCCGCAGTGCAAGGAGAGGAAGAACTACTGCGAGTTCCGGGGCAAGACCTATCAGAC 
TTTGGAGGAGTTCGTGGTGTCTCCATGCGAGAGGTGTCGCTGTGAAGCCAACGGTGAGGTGCT 
ATGCACAGTGTCAGCGTGTCCCCAGACGGAGTGTGTGGACCCTGTGTACGAGCCTGATCAGTG 
CTGTCCCATCTGCAAAAATGGTCCAAACTGCTTTGCAG7VAACCGCGGTGATCCCTGCTGGCAG 
AGAAGTGAAGACTGACGAGTGCACCATATGCCACTGTACTTATGAGGAAGGCACATGGAGAAT 
CGAGCGGCAGGCCATGTGCACGAGACATGAATGCAGGCAAATGTAGACGCTTCCCAGAACACA 
AACTCTGACTTTTTCTAGAACATTTTACTGATGTGAACATTCTAtATGACTCTGGGAACTATC 
AGTCAAAGAAGACTTTTGATGAGGAATAATGGAAAATTGTTGGTACTTTTCCTTTTCTTGATA 
ACAGTTACTACAACAGAAGGAAATGGATATATTTCAAAACATCAACAAGAACTTTGGGCATAA 
AATCCTTCTCTAAAT71AATGTGCTATTTTCACAGTAAGTACACAAAAGTACACTATTATATAT 
CAAATGTATTTCTATAATCCCTCCATTAGAGAGCTTATATAAGTGTTTTCTATAGATGCAGAT 
TAAAAATGCTGTGTTGTCAACCGTCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 516 

MPSSTAMAVGALSSSLLVTCCLMVALCSPSIPLEKLAQAPEQPGQEKREHATRDGPGRVNELG 
RPARDEGGSGRDWKSKSGRGLAGREPWSKLKQAWVSQGGGAKAGDLQVRPRGDTPQAEALAAA 
AQDAIGPELAPTPEPPEEYVYPDYRGKGCVDESGFVYAIGEKFAPGPSACPCLCTEEGPLCAQ 
PECPRLHPRCIHVDTSQCCPQCKERKNYCEFRGKTYQTLEEFWSPCERCRCEANGEVLCTVS 
ACPQTECVDPVYEPDQCCPICKNGPNCFAETAVIPAGREVKTDECTICHCTYEEGTWRIERQA 

MCTRHECRQM 

Important features: 
Signal peptide: 

amino acids 1-27 

Transmembrane domain: 

amino acids 11-30 

Glycosaminoglycan attachment site. 

amino acids 80-83 

N-myristoylation sites. ♦ 

amino acids 10-15, .102-107, 103-108. 

Cell attachment sequence. 

amino acids 114-117 

EGF-like domain cysteine pattern signature. 

amino acids 176-187 
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FIGURE 517 

GGACAACCGTTGCTGGGTGTCCCAGGGCCTGAGGCAGGACGGTACTCCGCTGACACCTTCCCT 
TTCGGCCTTGAGGTTCCCAGCCTGGTGGCCCCAGGACGTTCCGGTCGCATGGCAGAGTGCTAC 
GGACGACGCCTATGAAGCCCTTAGTCCTTCTAGTTGCGCTTTTGCTATGGCCTTCGTCTGTGC 
CGGCTTATCCGAGCATAACTGTGACACCTGATGAAGAGCAAAACTTGAATCATTATATACAAG 
TTTTAGAGAACCTAGTACGAAGTGTTCCCTCTGGGGAGCCAGGTCGTGAGAAAAAATCTAACT 
CTCCAAAACATGTTTATTCTATAGCATCAAAGGGATCAAAATTTAAGGAGCTAGTTACACATG 
GAGACGCTTCAACTGAGAATGATGTTTTAACCAATCCTATCAGTGAAGAAACTACAACTTTCC 
CTACAGGAGGCTTCACACCGGAAATAGGAAAGAAAAAACACACGGAAAGTACCCCATTCTGGT 
CGATCAAACCAAACAATGTTTCCATTGTTTTGCATGCAGAGGAACCTTATATTGAAAATGAAG 
AGCCAGAGCCAGAGCCGGAGCCAGCTGCAAAACAAACTGAGGCACCAAGAATGTTGCCAGTTG 
TTACTGAATCATCTACAAGTCCATATGTTACCTCATACAAGTCACCTGTCACCACTTTAGATA 
AGAGCACTGGCATTGAGATCTCTACAGAATCAGAAGATGTTCCTCAGCTCTCAGGTGAAACTG 
CGATAGAAAAACCCGAAGAGTTTGGAAAGCACCCAGAGAGTTGGAATAATGATGACATTTTGA 
AAAAAATTTTAGATATTAATTCACAAGTGCAACAGGCACTTCTTAGTGACACCAGCAACCCAG 
CATATAGAGAAGATATTGAAGCCTCTAAAGATCACCTAAAACGAAGCCTTGCTCTAGCAGCAG 
CAGCAGAACATAAATTAAAAACAATGTATAAGTCCCAGTTATTGCCAGTAGGACGAACAAGTA 
ATAAAATTGATGACATCGAAACTGTTATTAACATGCTGTGTAATTCTAGATCTAAACTCTATG 
AATATTTAGATATTAAATGTGTTCCACCAGAGATGAGAGAAAAAGCTGCTACAGTATTCAATA 
CATTAAAAAATATGTGTAGATCAAGGAGAGTCACAGCCTTATTAAAAGTTTATTAAACAATAA 
TATAAAAATTTTAAACCTACTTGATATTCCATAACAAAGCTGATTTAAGCAAACTGCATTTTT 
TCACAGGAGAAATAATCATATTCGTAATTTCAAAAGTTGTATAAAAATATTTTCTATTGTAGT 
TCAAATGTGCCAACATCTTTATGTGTCATGTGTTATGAACAATTTTCATATGCACTAAAAACC 
TAATTTAAAATAAAATTTTGGTTCAGGAAAAAA 
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FIGURE 518 

MKPLVLLVALLLWPSSVPAYPSITVTPDEEQNLNHYIQVLENLVRSVPSGEPGREKKSNSPKH 
VYSIASKGSKFKELVTHGDASTENDVLTNPISEETTTFPTGGFTPEIGKKKHTESTPFWSIKP 
NNVSIVLHAEEPYIENEEPEPEPEPAAKQTEAPRMLPVVTESSTSPYVTSYKSPVTTLDKSTG 
IEISTESEDVPQLSGETAIEKPEEFGKHPESWNNDDILKKILDINSQVQQALLSDTSNPAYRE 
DIEASKDHLKRSLALAAAAEHKLKTMYKSQLLPVGRTSNKIDDIETVINMLCNSRSKLYEYLD 
IKCVPPEMREKAATVFNTLKNMCRSRRVTALLKVY 

Important features: 
Signal peptide: 

amino acids 1-19 
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FIGURE 519 

CGGCTCGAGTGCAGCTGTGGGGAGATTTCAGTGCATTGCCTCCCCTGGGTGCTCTTCATCTTG 
GATTTGAAAGTTGAGAGCAGCATGTTTTGCCCACTGAAACTCATCCTGCTGCCAGTGTTACTG 
GATTATTCCTTGGGCCTGAATGACTTGAATGTTTCCCCGCCTGAGCTAACAGTCCATGTGGGT 
GATTCAGCTCTGATGGGATGTGTTTTCCAGAGCACAGAAGACAAATGTATATTCAAGATAGAC 
TGGACTCTGTCACCAGGAGAGCACGCCAAGGACGAATATGTGCTATACTATTACTCCAATCTC 
AGTGTGCCTATTGGGCGCTTCCAGAACCGCGTACACTTGATGGGGGACATCTTATGCAATGAT 
GGCTCTCTCCTGCTCCAAGATGTGCAAGAGGCTGACCAGGGAACCTATATCTGTGAAATCCGC 
CTCAAAGGGGAGAGCCAGGTGTTCAAGAAGGCGGTGGTACTGCATGTGCTTCCAGAGGAGCCC 
AAAGAGCTCATGGTCCATGTGGGTGGATTGATTCAGATGGGATGTGTTTTCCAGAGCACAGAA 
GTGAAACACGTGACC71AGGTAGAATGGATATTTTCAGGACGGCGCGCAAAGGAGGAGATTGTA 
TTTCGTTACTACCACAAACTCAGGATGTCTGTGGAGTACTCCCAGAGCTGGGGCCACTTCCAG 
AATCGTGTGAACCTGGTGGGGGACATTTTCCGCAATGACGGTTCCATCATGCTTCAAGGAGTG 
AGGGAGTCAGATGGAGGAAACTACACCTGCAGTATCCACCTAGGGAACCTGGTGTTCAAGAAA 
ACCATTGTGCTGCATGTCAGCCCGGAAGAGCCTCGAACACTGGTGACCCCGGCAGCCCTGAGG 
CCTCTGGTCTTGGGTGGTAATCAGTTGGTGATCATTGTGGGAATTGTCTGTGCCACAATCCTG 
CTGCTCCCTGTTCTGATATTGATCGTGAAGAAGACCTGTGGAAATAAGAGTTCAGTGAATTCT 
ACAGTCTTGGTGAAGAACACGAAGAAGACTAATCCAGAGATAAAAGAAAAACCCTGCCATTTT 
GAAAGATGTGAAGGGGAGAAACACATTTACTCCCCAATAATTGTACGGGAGGTGATCGAGGAA 
GAAGAACCAAGTGAAAAATCAGAGGCCACCTACATGACCATGCACCCAGTTTGGCCTTCTCTG 
AGGTCAGATCGGAACAACTCACTTGAAAAAAAGTCAGGTGGGGGAATGCCAAAAACACAGCAA 
GCCTTTTGAGAAGAATGGAGAGTCCCTTCATCTCAGCAGCGGTGGAGACTCTCTCCTGTGTGT 
GTCCTGGGCCACTCTACCAGTGATTTCAGACTCCCGCTCTCCCAGCTGTCCTCCTGTCTCATT 
GTTTGGTCAATACACTGAAGATGGAGAATTTGGAGCCTGGCAGAGAGACTGGACAGCTCTGGA 
GGAACAGGCCTGCTGAGGGGAGGGGAGCATGGACTTGGCCTCTGGAGTGGGACACTGGCCCTG 
GGAACCAGGCTGAGCTGAGTGGCCTCAAACCCCCCGTTGGATCAGACCCTCCTGTGGGCAGGG 
TTCTTAGTGGATGAGTTACTGGGAAGAATCAGAGATAAAAACCAACCCAAATCAA 
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FIGURE 520 

MFCPLKLILLPVLLDYSLGLNDLNVSPPELTVHVGDSALMGCVFQSTEDKCIFKIDWTLSPGE 
HAKDEYVLYYYSNLSVPIGRFQNRVHLMGDILCNDGSLLLQDVQEADQGTYICEIRLKGESQV 
FKKAWLHVLPEEPKELMVHVGGLIQMGCVFQSTEVKHVTKVEWIFSGRRAKEEIVFRYYHKL 
RMSVEYSQSWGHFQNRVNLVGDIFRNDGSIMLQGVRESDGGNYTCSIHLGNLVFKKTIVLHVS 
PEEPRTLVTPAALRPLVLGGNQLVIIVGIVCATILLLPVLILIVKKTCGNKSSVNSTVLVKNT 
KKTNPEIKEKPCHFERCEGEKHIYSPIIVREVIEEEEPSEKSEATYMTMHPVWPSLRSDRNNS 
LEKKSGGGMPKTQQAF 
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FIGURE 521 

CTATGAAGAAGCTTCCTGGAAAACAATAAGC7\AAGG7^AAACAAATGTGTCCCATCTCACATGG 
TTCTACCCTACTAAAGACAGGAAGATCATAAACTGACAGATACTGAAATTGTAAGAGTTGGAA 
ACTACATTTTGCAAAGTCATTGAACTCTGAGCTCAGTTGCAGTACTCGGGAAGCCATGCAGGA 
TGAAGATGGATACATCACCTTAAATATTAAAACTCGGAAACCAGCTCTCGTCTCCGTTGGCCC 
TGCATCCTCCTCCTGGTGGCGTGTGATGGCTTTGATTCTGCTGATCCTGTGCGTGGGGATGGT 
TGTCGGGCTGGTGGCTCTGGGGATTTGGTCTGTCATGCAGCGCAATTACCTACAAGATGAGAA 
TGAAAATCGCACAGGAACTCTGCAACAATTAGCAAAGCGCTTCTGTCAATATGTGGTAAAACA 
ATCAGAACTAAAGGGCACTTTCAAAGGTCATAAATGCAGCCCCTGTGACACAAACTGGAGATA 
TTATGGAGATAGCTGCTATGGGTTCTTCAGGCACAACTTAACATGGGAAGAGAGTAAGCAGTA 
CTGCACTGACATGAATGCTACTCTCCTGAAGATTGACAACCGGAACATTGTGGAGTACATCAA 
AGCCAGGACTCATTTAATTCGTTGGGTCGGATTATCTCGCCAGAAGTCGAATGAGGTCTGGAA 
GTGGGAGGATGGCTCGGTTATCTCAGAAAATATGTTTGAGTTTTTGGAAGATGGAAAAGGAAA 
TATGAATTGTGCTTATTTTCATAATGGGAAAATGCACCCTACCTTCTGTGAGAACAAACATTA 
TTTAATGTGTGAGAGGAAGGCTGGCATGACCAAGGTGGACCAACTACCTTAATGCAAAGAGGT 
GGACAGGATAACACAGATAAGGGCTTTATTGTACAATAAAAGATATGTATGAATGCATCAGTA 
GCTGAAAAAAAAAAAAAA 
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FIGURE 522 

MQDEDGYITLNIKTRKPALVSVGPASSSWWRVMALILLILCVGMVVGLVALGIWSVMQRNYLQ 
DENENRTGTLQQLAKRFCQYWKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHNLTWEES 
KQYCTDMNATLLKIDNRNIVEYIKARTHLIRWVGLSRQKSNEVWKWEDGSVISENMFEFLEDG 
KGNMNCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP 
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FIGURE 523 

CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATGGC 
AAAGAATCCTCCAGAGAATTGTGAAGACTGTCACATTCTAAATGCAGAAGCTTTTAAATCCAA 
GAAAATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAAT 
TGTCCTGTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGA 
GCACACTTTCTACAGCAATGGAGAGAAGAAGAAGATTTACATGGAAATTGATCCTGTGACCAG 
AACTGAAATATTCAGAAGCGGAAATGGCACTGATGAAACATTGGAAGTGCACGACTTTAAAAA 
CGGATACACTGGCATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGT 
GATTCCTGAATTTTCTGAACCAGAAGAGGAAATAGATGAGAATGAAGAAATTACCACAACTTT 
CTTTG7yVCAGTCAGTGATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAA 
AAATTCCAAAATTCTGGAGATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTAAT 
ATCAGTTTCTGAGTTAGAAGACTTTGAGGAGGAGGGAGAAGATCTTCACTTTCCTGCCAACGA 
AAAAAAAGGGATTGAACAAAATGAACAGTGGGTGGTCCCTGAAGTGAAAGTAGAGAAGACCCG 
TCACGCCAGACAAGCAAGTGAGGAAGAACTTCCAATAAATGACTATACTGAAAATGGAATAGA 
ATTTGATCCCATGCTGGATGAGAGAGGTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTA 
TTGCCGCCGCGTCTGTGAACCTTTACTAGGCTACTACCCATATCCATACTGCTACCAAGGAGG 
ACGAGTCATCTGTCGTGTCATCATGCCTTGTAACTGGTGGGTGGCCCGCATGCTGGGGAGGGT 
C TAA TAGGAGGTTTGAGCTCAAATGCTTAAACTGCTGGCAACATATAATAAATGCATGCTATT 
CAATGAATTTCTGCCTATGAGGCATCTGGCCCCTGGTAGCCAGCTCTCCAGAATTACTTGTAG 
GTAATTCCTCTCTTCATGTTCTAATAAACTTCTACATTATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 524 

MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYD 
MEHTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQI 
KVIPEFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPT 
LISVSELQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRfiARQASEEELPINDYTENG 
IEFDPMLDERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites . 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 525 

AGTGACAATCTCAGAGCAGCTTCTACACCACAGCCATTTCCAGCATGAAGATCACTGGGGGTC 
TCCTTCTGCTCTGTACAGTGGTCTATTTCTGTAGCAGCTCAGAAGCTGCTAGTCTGTCTCCAA 
AAAAAGTGGACTGCAGCATTTACAAGAAGTATCCAGTGGTGGCCATCCCCTGCCCCATCACAT 
ACCTACCAGTTTGTGGTTCTGACTACATCACCTATGGGAATGAATGTCACTTGTGTACCGAGA 
GCTTGAAT^AGTAATGGAAGAGTTCAGTTTCTTCACGATGGAAGTTGCTAAATTCTCCATGGAC 
ATAGAGAGAAAGGAATGATATTCTCATCATCATCTTCATCATCCCAGGCTCTGACTGAGTTTC 
TTTCAGTTTTACTGATGTTCTGGGTGGGGGACAGAGCCAGATTCAGAGTAATCTTGACTGAAT 
GGAGAAAGTTTCTGTGCTACCCCTACAAACCCATGCCTCACTGACAGACCAGCATTTTTTTTT 
TAACACGTCAATAAAAAAATAATCTCCCAGA 
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FIGURE 526 

MKITGGLLLLCTWYFCSSSEAASLSPKKVDCSIYKKYPVVAIPCPITYLPVCGSDYITYGNE 
CHLCTESLKSNGRVQFLHDGSC 

Important features: 
Signal peptide: 

amino acids 1-19 
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FIGURE 527 

CGACGATGCTACGCGCGCCCGGCTGCCTCCTCCGGACCTCCGTAGCGCCTGCCGCGGCCCTGG 
CTGCGGCGCTGCTCTCGTCGCTTGCGCGCTGCTCTCTTCTAGAGCCGAGGGACCCGGTGGCCT 
CGTCGCTCAGCCCCTATTTCGGCACCAAGACTCGCTACGAGGATGTCAACCCCGTGCTATTGT 
CGGGCCCCGAGGCTCCGTGGCGGGACCCTGAGCTGCTGGAGGGGACCTGCACCCCGGTGCAGC 
TGGTCGCCCTCATTCGCCACGGCACCCGCTACCCCACGGTCAAACAGATCCGCAAGCTGAGGC 
AGCTGCACGGGTTGCTGCAGGCCCGCGGGTCCAGGGATGGCGGGGCTAGTAGTACCGGCAGCC 
GCGACCTGGGTGCAGCGCTGGCCGACTGGCCTTTGTGGTACGCGGACTGGATGGACGGGCAGC 
TAGTAGAGAAGGGACGGCAGGATATGCGACAGCTGGCGCTGCGTCTGGCCTCGCTCTTCCCGG 
CCCTTTTCAGCCGTGAGAACTACGGCCGCCTGCGGCTCATCACCAGTTCCAAGCACCGCTGCA 
TGGATAGCAGCGCCGCCTTCCTGCAGGGGCTGTGGCAGCACTACCACCCTGGCTTGCCGCCGC 
CGGACGTCGCAGATATGGAGTTTGGACCTCCAACAGTTAATGATAAACTAATGAGATTTTTTG 
ATCACTGTGAGAAGTTTTTAACTGAAGTAGAAAAAAATGCTACAGCTCTTTATCACGTGGAAG 
CCTTCAAAACTGGACCAGAAATGCAGAACATTTTAAAAAAAGTTGCAGCTACTTTGCAAGTGC 
CAGTAAATGATTTAAATGCAGATTTAATTCAAGTAGCCTTTTTCACCTGTTCATTTGACCTGG 
CAATTAAAGGTGTTAAATCTCCTTGGTGTGATGTTTTTGACATAGATGATGCAAAGGTATTAG 
AATATTTAAATGATCTGAAACAATATTGGAAAAGAGGATATGGGTATACTATTAACAGTCGAT 
CCAGCTGCACCTTGTTTCAGGATATCTTTCAGCACTTGGACAAAGCAGTTGAACAGAAACAAA 
GGTCTCAGCCAATTTCTTCTCCAGTCATCCTCCAGTTTGGTCATGCAGAGACTCTTCTTCCAC 
TGCTTTCTCTCATGGGCTACTTCAAAGACAAGGAACCCCTAACAGCGTACAATTACAAAAAAC 
AAATGCATCGGAAGTTCCGAAGTGGTCTCATTGTACCTTATGCCTCGAACCTGATATTTGTGC 
TTTACCACTGTGAAAATGCTAAGACTCCTAAAGAACAATTCCGAGTGCAGATGTTATTAAATG 
AAAAGGTGTTACCTTTGGCTTACTCACAAGAAACTGTTTCATTTTATGAAGATCTGAAGAACC 
ACTACAAGGACATCCTTCAGAGTTGTCAAACCAGTGAAGAATGTGAATTAGCAAGGGCTAACA 
GTACATCTGATGAACTATGAGTAACTGAAGAACATTTTTAATTCTTTAGGAATCTGCAATGAG 
TGATTACATGCTTGTAATAGGTAGGCAATTCCTTGATTACAGGAAGCTTTTATATTACTTGAG 
TATTTCTGTCTTTTCACAGAAAAACATTGGGTTTCTCTCTGGGTTTGGACATGAAATGTAAGA 
AAAGATTTTTCACTGGAGCAGCTCTCTTAAGGAGAZ^ACAAATCTATTTAGAGAAACAGCTGGC 
CCTGCAAATGTTTACAGAAATGAAATTCTTCCTACTTATATAAGAAATCTCACACTGAGATAG 
AATTGTGATTTCATAATAACACTTGAAAAGTGCTGGAGTAACAAAATATCTCAGTTGGACCAT 
CCTTAACTTGATTGAACTGTCTAGGAACTTTACAGATTGTTCTGCAGTTCTCTCTTCTTTTCC 
TCAGGTAGGACAGCTCTAGCATTTTCTTAATCAGGAATATTGTGGTAAGCTGGGAGTATCACT 
CTGGAAGAAAGTAACATCTCCAGATGAGAATTTGAAACAAGAAACAGAGTGTTGTAAAAGGAC 
ACCTTCACTGAAGCAAGTCGGAAAGTACAATGAAAATAAATATTTTTGGTATTTATTTATGAA 
ATATTTGAACATTTTTTCAATAATTCCTTTTTACTTCTAGGAAGTCTCAAAAGACCATCTTAA 
ATTATTATATGTTTGGACAATTAGCAACAAGTCAGATAGTTAGAATCGAAGTTTTTCAAATCC 
ATTGCTTAGCTT^ACTTTTTCATTCTGTCACTTGGCTTCGATTTTTATATTTTCCTATTATATG 
AAATGTATCTTTTGGTTGTTTGATTTTTCTTTCTTTCTTTGTAAATAGTTCTGAGTTCTGTCA 
AATGCCGTGAAAGTATTTGCTATAATAAAGAAAATTCTTGTGACTTTAAAAAAAAA 
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FIGURE 528 

MLRAPGCLLRTSVAPAAALAAALLSSLARCSLLEPRDPVASSLSPYFGTKTRYEDVNPVLLSG 
PEAPWRDPELLEGTCTPVQLVALIRHGTRYPTVKQIRKLRQLHGLLQARGSRDGGASSTGSRD 
LGAALADWPLWYADWMDGQLVEKGRQDMRQLALRLASLFPALFSRENYGRLRLITSSKHRCMD 
SSAAFLQGLWQHYHPGLPPPDVADMEFGPPTVNDKLMRFFDHCEKFLTEVEKNATALYHVEAF 
KTGPEMQNILKKVAATLQVPVNDLNADLIQVAFFTCSFDLAIKGVKSPWCDVFDIDDAKVLEY 
LNDLKQYWKRGYGYTINSRSSCTLFQDIFQHLDKAVEQKQRSQPISSPVILQFGHAETLLPLL 
SLMGYFKDKEPLTAYNYKKQMHRKFRSGLIVPYASNLIFVLYHCENAKTPKEQFRVQMLLNEK 
VLPLAYSQETVSFYEDLKNHYKDILQSCQTSEECELARANSTSDEL 

Important features: 
Signal sequence 

amino acids 1-30 

N-glycosylation sites . 

amino acids 242-246, 481-485 

N-myristoylation sites . 

amino, acids 107-113, . 113-119, 117-123, 118-124, 128-134 

Endoplasmic reticulum targeting sequence. 

amino acids 4 84-489 
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FIGURE 529 

GGAGAGCCGCGGCTGGGACCGGAGTGGGGAGCGCGGCGTGGAGGTGCCACCCGGCGCGGGTGG 
CGGAGAGATCAGAAGCCTCTTCCCCAAGCCGAGCCAACCTCAGCGGGGACCCGGGCTCAGGGA 
CGCGGCGGCGGCGGCGGCGACTGCAGTGGCTGGACGATGGCAGCGTCCGCCGGAGCCGGGGCG 
GTGATTGCAGCCCCAGACAGCCGGCGCTGGCTGTGGTCGGTGCTGGCGGCGGCGCTTGGGCTC 
TTGACAGCTGGAGTATCAGCCTTGGAAGTATATACGCCAAAAGAAATCTTCGTGGCAAATGGT 
ACACAAGGGAAGCTGACCTGCAAGTTCAAGTCTACTAGTACGACTGGCGGGTTGACCTCAGTC 
TCCTGGAGCTTCCAGCCAGAGGGGGCCGACACTACTGTGTCGTTTTTCCACTACTCCCAAGGG 
CAAGTGTACCTTGGGAATTATCCACCATTTAAAGACAGAATCAGCTGGGCTGGAGACCTTGAC 
AAGAAAGATGCATCAATCAACATAGAA7U\TATGCAGTTTATACACAATGGCACCTATATCTGT 
GATGTCAAAAACCCTCCTGACATCGTTGTCCAGCGTGGACACATTAGGCTCTATGTCGTAGAA 
ATVAGAGAATTTGCCTGTGTTTCCAGTTTGGGTAGTGGTGGGCATAGTTACTGCTGTGGTCCTA 
GGTCTCACTCTGCTCATCAGCATGATTCTGGCTGTCCTCTATAGAAGGAAAAACTCTAAACGG 
GATTACACTGGCTGCAGTACATCAGAGAGTTTGTCACCAGTTAAGCAGGCTCCTCGGAAGTCC 
CCCTCCGACACTGAGGGTCTTGTAAAGAGTCTGCCTTCTGGATCTCACCAGGGCCCAGTCATA 
TATGCACAGTTAGACCACTCCGGCGGACATCACAGTGACAAGATTAACAAGTCAGAGTCTGTG 
GTGTATGCGGATATCCGAAAGAAT TAA GAGAATACCTAGAACATATCCTCAGCAAGAAACAAA 
ACCAAACTGGACTCTCGTGCAGAAAATGTAGCCCATTACCACATGTAGCCTTGGAGACCCAGG 
CAAGGACAAGTACACGTGTACTCACAGAGGGAGAGAAAGATGTGTACAAAGGATATGTATAAA 
TATTCTATTTAGTCATCCTGATATGAGGAGCCAGTGTTGCATGATGAAAAGATGGTATGATTC 
TACATATGTACCCATTGTCTTGCTGTTTTTGTACTTTCTTTTCAGGTCATTTACAATTGGGAG 
ATTTCAGAAACATTCCTTTCACCATCATTTAGAAATGGTTTGCCTTAATGGAGACAATAGCAG 
ATCCTGTAGTATTTCCAGTAGACATGGCCTTTTAATCTAAGGGCTTAAGACTGATTAGTCTTA 
GCATTTACTGTAGTTGGAGGATGGAGATGCTATGATGGAAGCATACCCAGGGTGGCCTTTAGC 
ACAGTATCAGTACCATTTATTTGTCTGCCGCTTTTAAAAAATACCCATTGGCTATGCCACTTG 
AAAACAATTTGAGAAGTTTTTTTGAAGTTTTTCTCACTAAAATATGGGGCAATTGTTAGCCTT 
ACATGTTGTGTAGACTTACTTTAAGTTTGCACCCTTGAAATGTGTCATATCAATTTCTGGATT 
CATAATAGCAAGATTAGCAAAGGATAAATGCCGAAGGTCACTTCATTCTGGACACAGTTGGAT 
CAATACTGATTAAGTAGAAAATCCAAGCTTTGCT-TGAGAACTTTTGTAACGTGGAGAGTAAAA 
AGTATCGGTTTTA 
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FIGURE 530 

MAASAGAGAVIAAPDSRRWLWSVLAAALGLLTAGVSALEVYTPKEIFVANGTQGKLTCKFKST 
STTGGLTSVSWSFQPEGADTTVSFFHYSQGQVYLGNYPPFKDRISWAGDLDKKDASINIENMQ 
FIHNGTYICDVKNPPDIVVQPGHIRLYWEKENLPVFPVWWVGIVTAVVLGLTLLISMILAV 
LYRRKNSKRDYTGCSTSESLSPVKQAPRKSPSDTEGLVKSLPSGSHQGPVIYAQLDHSGGHHS 
DKINKSESWYADIRKN 

Important features : 
Signal peptide: 

amino acids 1-37 



Transmembrane domain: 

amino acids 161-183 
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FIGURE 531 

GTGACACTATAGAAGAGCTATGACGTCGCATGCACGCGTACGTAAGCTCGGAATTCGGCTCGA 
GGCTGGTGGGAAGAAGCCGAGATGGCGGCAGCCAGCGCTGGGGCAACCCGGCTGCTCCTGCTC 
TTGCTGATGGCGGTAGCAGCGCCCAGTCGAGCCGGGGGCAGCGGCTGCCGGGCCGGGACTGGT 
GCGCGAGGGGCTGGGGCGGAAGGTCGAGAGGGCGAGGCCTGTGGCACGGTGGGGCTGCTGCTG 
GAGCACTCATTTGAGATCGATGACAGTGCCAACTTCCGGAAGCGGGGCTCACTGCTCTGGAAC 
CAGCAGGATGGTACCTTGTCCCTGTCACAGCGGCAGCTCAGCGAGGAGGAGCGGGGCCGACTC 
CGGGATGTGGCAGCCCTGAATGGCCTGTACCGGGTCCGGATCCCAAGGCGACCCGGGGCCCTG 
GATGGCCTGGAAGCTGGTGGCTATGTCTCCTCCTTTGTCCCTGCGTGCTCCCTGGTGGAGTCG 
CACCTGTCGGACCAGCTGACCCTGCACGTGGATGTGGCCGGCAACGTGGTGGGCGTGTCGGTG 
GTGACGCACCCCGGGGGCTGCCGGGGCCATGAGGTGGAGGACGTGGACCTGGAGCTGTTCAAC 
ACCTCGGTGCAGCTGCAGCCGCCCACCACAGCCCCAGGCCCTGAGACGGCGGCCTTCATTGAG 
CGCCTGGAGATGGAACAGGCCCAGAAGGCCAAGAACCCCCAGGAGCAGAAGTCCTTCTTCGCC 
AAATACTGGATGTACATCATTCCCGTCGTCCTGTTCCTCATGATGTCAGGAGCGCCAGACACC 
GGGGGCCAGGGTGGGGGTGGGGGTGGGGGTGGTGGTGGGGGTAGTGGCCTTTGCTGTGTGCCA 
CCCTCCCTGTAAGTCTATTTAAAAACATCGACGATACATTGAAATGTGTGAACGTTTTGAAAA 
GCTACAGCTTCCAGCAGCCAAAAGCAACTGTTGTTTTGGCAAGACGGTCCTGATGTACAAGCT 
TGATTGAAATTCACTGCTCACTTGATACGTTATTCAGAAACCCAAGGAATGGCTGTCCCCATC 
CTCATGTGGCTGTGTGGAGCTCAGCTGTGTTGTGTGGCAGTTTATTAAACTGTCCCCCAGATC 
GACACGCAAAAAAAAA 
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FIGURE 532 

MAAASAGATRLLLLLLMAVAAPSRARGSGCRAGTGARGAGAEGREGEACGTVGLLLEHSFEID 
DSANFRKRGSLLWNQQDGTLSLSQRQLSEEERGRLRDVAALNGLYRVRIPRRPGALDGLEAGG 
YVSSFVPACSLVESHLSDQLTLHVDVAGNWGVSWTHPGGCRGHEVEDVDLELFNTSVQLQP 
PTTAPGPETAAFIERLEMEQAQKAKNPQEQKSFFAKYWMYIIPWLFLMMSGAPDTGGQGGGG 
GGGGGGGSGLCCVPPSL 

Important features: 
Signal peptide: 

amino acids 1-24 



Transmembrane domain: 

amino acids 226-243 
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FIGURE 533 

TCTGCCTCCACTGCTCTGTGCTGGGATCATGGAACTTGCACTGCTGTGTGGGCTGGTGGTGAT 

GGCTGGTGTGATTCCAATCCAGGGCGGGATCCTGAACCTGAACAAGATGGTCAAGCAAGTGAC 

TGGGAAAATGCCCATCCTCTCCTACTGGCCCTACGGCTGTCACTGCGGACTAGGTGGCAGAGG 

CCAACCCAAAGATGCCACGGACTGGTGCTGCCAGACCCATGACTGCTGCTATGACCACCTGAA 

GACCCAGGGGTGCGGCATCTACAAGGACAACAACAAAAGCAGCATACATTGTATGGATTTATC 

TCAACGCTATTGTTTAATGGCTGTGTTTAATGTGATCTATCTGGAAAATGAGGACTCCGAATA 

AAAAGCTATTACTAWTTNAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 534 

MELALLCGLVVMAGVIPIQGGILNLNKMVKQVTGKMPILSYWPYGCHCGLGGRGQPKDATDWC 
CQTHDCCYDHLKTQGCGIYKDNNKSSIHCMDLSQRYCLMAVFNVIYLENEDSE 

Important features: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 1-24 

N-glycosylation site. 

amino acids 86-89 

N-myristoylation sites, 

amino acids 20-25, 45-50 

Phospholipase A2 histidine active site. 

amino acids 63-70 
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FIGURE 535 

GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTG 
AACTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCC 
CCAAATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATT-GTAGGATCCAGTTTTT 
TTTTTAACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCAT 
GAAGATCCTATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTT 
GTTCTTGGAGTGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAG 
GGGTCCAATTTTTCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGG 
CTGTCATGCAACTGGCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTTTGAACAATACAA 
AGGATGGGTTTCAATGTAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCC 
ACTGTCTTACTGACAATGCTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAA 
GGCAAAATGGTATATTGTGAATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGT 
TGCTTAGGTTTGTCCCTTCGCTATAACAGCCTTCAAAAACTTAAGTATAATCAATTTAAAGGG 
CTCAACCAGCTCACCTGGCTATACCTTGACCATAACCATATCAGCAATATTGACGAAAATGCT 
TTTAATGGAATACGCAGACTCAAAGAGCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTT 
AACAATACCTTCAGACCTGTGACAAATTTACGGAACTTGGATCTGTCCTATAATCAGCTGCAT 
TCTCTGGGATCTGAACAGTTTCGGGGCTTGCGGAAGCTGCTGAGTTTACATTTACGGTCTAAC 
TCCCTGAGAACCATCCCTGTGCGAATATTCCAAGACTGCCGCAACCTGGAACTTTTGGACCTG 
GGATATAACCGGATCCGAAGTTTAGCCAGGAATGTCTTTGCTGGCATGATCAGACTCAAAGAA 
CTTCACCTGGAGCACAATCAATTTTCCAAGCTCAACCTGGCGCTTTTTCCAAGGTTGGTCAGC 
CTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGTCATAGGACAGACCATGTCCTGGACC 
TGGAGCTCCTTACAAAGGCTTGATTTATCAGGCAATGAGATCGAAGCTTTCAGTGGACCCAGT 
GTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGATTCCT^ACAAGCTCACATTTATT 
GGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAGTCTTGCTGGGAATATATGG 
GAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTTTTAAAGGTCTAAGGGAG 
AATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTGATCGATGCAGTGAAG 
AACTACAGCATCTGTGGCAAAAGTACTACAGAGAGGTTTGATCTGGCCAGGGCTCTCCCAAAG 
CCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTTTGCCCCCGACG 
GTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCTTTCCATAAA 
ATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTATCTACGTG 
T.CATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAAGGCAC 
AGGAAAAAGAAAAGACAGTCCCTAAAGCAAATGACTCCCAGCACCCAGGAATTTTATGTAGAT 
TAT7yU\CCCACCAACACGGAGACCAGCGAGATGCTGCTGAATGGGACGGGACCCTGCACCTAT 
AACAAATCGGGCTCCAGGGAGTGTGAGGTATGAACCATTGTGATAAAAAGAGCTCTTAAAAGC 
TGGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTC 
CCCTTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATA 
ATACTGGTCATTTTCCTCTCATACATAATCAACCCATTGAAATTTAAATACCACAATCAATGT 
GAAGCTTGAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAAT 
GTCGCATTTGTTTTAAGATAAAACTTCTTTCATAGGTAAAAAAAAAAA 
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FIGURE 536 

MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGC 
LGLSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLN 
NTFRPVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLG 
YNRIRSLARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTW 
SSLQRLDLSGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWE 
CSRNICSLVNWLKSFKGLRENTIICASPKELQGVNVIDAVKNYSICGKSTTERFDLARALPKP 
TFKPKLPRPKHESKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVS 
WKRYPASMKQLQQRSLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYN 
KSGSRECEV 

Important features: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 4 65-4 68 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites . 

amino acids 11-16, 33-38, 24^-250, 332-337, 497-502, 507-512 
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FIGURE 537 



GGGACTACAAGCCGCGCCGCGCTGCCGCTGGCCCCTCAGCAACCCTCGACAT6GCGCTGAGGCGGCCACCGCGAC 
TCCGGCTCTGCGCTCGGCTGCCTGACTTCTTCCTGCTGCTGCTTTTCAGGGGCTGCCTGATAGGGGCTGTAAATC 
TCAAATCCAGCAATCGAACCCCAGTGGTACAGGAATTTGAAAGTGTGGAACTGTCTTGCATCATTACGGATTCGC 
AGACAAGTGACCCCAGGATCGAGTGGAAGAAAATTCAAGATGAACAAACCACATATGTGTTTTTTGACAACAAAA 
TTCAGGGAGACTTGGCGGGTCGTGCAGAAATACTGGGGAAGACATCCCTGAAGATCTGGAATGTGACACGGAGAG 
ACTCAGCCCTTTATCGCTGTGAGGTCGTTGCTCGAAATGACCGCAAGGAAATTGATGAGATTGTGATCGAGTTAA 
CTGTGCAAGTGAAGCCAGTGACCCCTGTCTGTAGAGTGCCGAAGGCTGTACCAGTAGGCAAGATGGCAACACTGC 
ACTGCCAGGAGAGTGAGGGCCACCCCCGGCCTCACTACAGCTGGTATCGCAATGATGTACCACTGCCCACGGATT 
CCAGAGCCAATCCCAGATTTCGCAATTCTTCTTTCCACTTAAACTCTGAAACAGGCACTTTGGTGTTCACTGCTG 
TTCACAAGGACGACTCTGGGCAGTACTACTGCATTGCTTCCAATGACGCAGGCTCAGCCAGGTGTGAGGAGCAGG 
AGATGGAAGTCTATGACCTGAACATTGGCGGAATTATTGGGGGGGTTCTGGTTGTCCTTGCTGTACTGGCCCTGA 
TCACGTTGGGCATCTGCTGTGCATACAGACGTGGCTACTTCATCAACAATAAACAGGATGGAGAAAGTTACAAGA 
ACCCAGGGAAACCAGATGGAGTTAACTACATCCGCACTGACGAGGAGGGCGACTTCAGACACAAGTCATCGTTTG 
TGATCTGAGACCCGCGGTGTGGCTGAGAGCGCACAGAGCGCACGTGCACATACCTCTGCTAGAAACTCCTGTCAA 
GGCAGCGAGAGCTGATGCACTCGGACAGAGCTAGACACTCATTCAGAAGCTTTTCGTTTTGGCCAAAGTTGACCA 
CTACTCTTCTTACTCTAACAAGCCACATGAATAGAAGAATTTTCCTCAAGATGGACCCGGTAAATATAACCACAA 
GGAAGCGAAACTGGGTGCGTTCACTGAGTTGGGTTCCTAATCTGTTTCTGGCCTGATTCCCGCATGAGTATTAGG 
GTGATCTTAAAGAGTTTGCTCACGTAAACGCCCGTGCTGGGCCCTGTGAAGCCAGCATGTTCACCACTGGTCGTT 
CAGCAGCCACGACAGCACCATGTGAGATGGCGAGGTGGCTGGACAGCACCAGCAGCGCATCCCGGCGGGAACCCA 
GAAAAGGCTTCTTACACAGCAGCCTTACTTCATCGGCCCACAGACACCACCGCAGTTTCTTCTTAAAGGCTCTGC 
TGATCGGTGTTGCAGTGTCCATTGTGGAGAAGCTTTTTGGATCAGCATTTTGTAAAAACAACCAAAATCAGGAAG 
GTAAATTGGTTGCTGGAAGAGGGATCTTGCCTGAGGAACCCTGCTTGTCCAACAGGGTGTCAGGATTTAAGGAAA 
ACCTTCGTCTTAGGCTAAGTCTGAAATGGTACTGAAATATGCTTTTCTATGGGTCTTGTTTATTTTATAAAATTT 
TACATCTAAATTTTTGCTAAGGATGTATTTTGATTATTGAAAAGAAAATTTCTATTTAAACTGTAAATATATTGT 
CATACAATGTTAAATAACCTATTTTTTTAAAAAAGTTCAACTTAAGGTAGAAGTTCCAAGCTACTAGTGTTAAAT 
TGGAAAATATCAATAATT7VAGAGTATTTTACCCAAGGAATCCTCTCATGGAAGTTTACTGTGATGTTCCTTTTCT 
CACACAAGTTTTAGCCTTTTTCACAAGGGAACTCATACTGTCTACACATCAGACCATAGTTGCTTAGGAAACCTT 
TAAAAATTCCAGTTAAGCAATGTTGAAATCAGTTTGCATCTCTTCAAAAGAAACCTCTCAGGTTAGCTTTGAACT 
GCCTCTTCCTGAGATGACTAGGACAGTCTGTACCCAGAGGCCACCCAGAAGCCCTCAGATGTACATACACAGATG 
CCAGTCAGCTCCTGGGGTTGCGCCAGGCGCCCCCGCTCTAGCTCACTGTTGCCTCGCTGTCTGCCAGGAGGCCCT 
GCCATCCTTGGGCCCTGGCAGTGGCTGTGTCCCAGTGAGCTTTACTCACGTGGCCCTTGCTTCATCCAGCACAGC 
TCTCAGGTGGGCACTGCAGGGACACTGGTGTCTTCCATGTAGCGTCCCAGCTTTGGGCTCCTGTAACAGACCTCT 
TTTTGGTTATGGATGGCTCACAAAATAGGGCCCCCAATGCTATTTTTTTTTTTTAAGTTTGTTTAATTATTTGTT 
AAGATTGTCTAAGGCCAAAGGCAATTGCGAAATCAAGTCTGTCAAGTACAATAACATTTTTAAAAGAAAATGGAT 
CCCACTGTTCCTCTTTGCCACAGAGAAAGCACCCAGACGCCACAGGCTCTGTCGCATTTCAAAACAAACCATGAT 
GGAGTGGCGGCCAGTCCAGCCTTTTAAAGAACGTCAGGTGGAGCAGCCAGGTGAAAGGCCTGGCGGGGAGGAAAG 
TGAAACGCCTGAATCAAAAGCAGTTTTCTAATTTTGACTTTAAATTTTTCATCCGCCGGAGACACTGCTCCCATT 
TGTGGGGGGACATTAGCAACATCACTCAGAAGCCTGTGTTCTTCAAGAGCAGGTGTTCTCAGCCTCACATGCCCT 
GCCGTGCTGGACTCAGGACTGAAGTGCTGTAAAGCAAGGAGCTGCTGAGAAGGAGCACTCCACTGTGTGCCTGGA 
GAATGGCTCTCACTACTCACCTTGTCTTTCAGCTTCCAGTGTCTTGGGTTTTTTATACTTTGACAGCTTTTTTTT 
AATTGCATACATGAGACTGTGTTGACTTTTTTTAGTTATGTGAAACACTTTGCCGCAGGCCGCCTGGCAGAGGCA 
GGAAATGCTCCAGCAGTGGCTCAGTGCTCCCTGGTGTCTGCTGCATGGCATCCTGGATGCTTAGCATGCAAGTTC 
CCTCCATCATTGCCACCTTGGTAGAGAGGGATGGCTCCCCACCCTCAGCGTTGGGGATTCACGCTCCAGCCTCCT 
TCTTGGTTGTCATAGTGATAGGGTAGCCTTATTGCCCCCTCTTCTTATACCCTAAAACCTTCTACACTAGTGCCA 
TGGGAACCAGGTCTGAAAAAGTAGAGAGAAGTGAAAGTAGAGTCTGGGAAGTAGCTGCCTATAACTGAGACTAGA 
CGGAAAAGGAATACTCGTGTATTTTAAGATATGAATGTGACTCAAGACTCGAGGCCGATACGAGGCTGTGATTCT 
GCCTTTGGATGGATGTTGCTGTACACAGATGCTACAGACTTGTACTAACACACCGTAATTTGGCATTTGTTTAAC 
CTCATTTATAAAAGCTTCAAAAAAACCCA 
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FIGURE 538 

MALRRPPRLRLCARLPDFFLLLLFRGCLIGAVNLKSSNRTPVVQEFESVELSCIITDSQTSDP 
RIEWKKIQDEQTTYVFFDNKIQGDLAGRAEILGKTSLKIWNVTRRDSALYRCEVVARNDRKEI 
DEIVIELTVQVKPVTPVCRVPKAVPVGKMATLHCQESEGHPRPHYSWYRNDVPLPTDSRANPR 
FRNSSFHLNSETGTLVFTAVHKDDSGQYYCIASNDAGSARCEEQEMEVYDLNIGGIIGGVLVV 
LAVLALITLGICCAYRRGYFINNKQDGESYKNPGKPDGVNYIRTDEEGDFRHKSSFVI 

Important features: 
Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 243-263 

N-glycosylation sites. 

amino acids 104-107, 192-195 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 107-110 

Casein kinase II phosphorylation site. 

amino acids 106-109, 296-299 

Tyrosine kinase phosphorylation site. 

amino acids 69-77 

N-myristoylation sites . 

amino .acids 26-31, 215-220, 226-231, 243-248, 244-249, 262-267 
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FIGURE 539 

CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGC 
GCAACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAA 
CTTCAGTCCCCCAAACGCGCACCCTCGAAGTCTTG2\ACTCCAGCCCCGCACATCCACGCGCGG 
CACAGGCGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGG 
GCTCGGGCGGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGG 
CTGCGCCCTGGGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGA 
TGAGCCGCGTGGTCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCC 
GCCGCGTGGTCAGCGGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGG 
CCTACTTCCATGAACTGTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTG 
AGGGAGGAGTCCTCCTCAGCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGC 
AAAACCTGACAAAACCCGGGACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGA 
ATGGAGATGGGCAAACATCTGGTGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATT 
CCCAGTACCGAAACTGGTACACAGATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGT 
ATCACCAACCAACTGCCAATCCTGGCCTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACA 
GGTGTAACATGAAGCACAATTATATTTGCAAGTATGAACCAGAGATTAATCCAACAGCCCCTG 
TAGAAAAGCCTTATCTTACAAATCAACCAGGAGACACCCATCAGAATGTGGTTGTTACTGAAG 
CAGGTATAATTCCCAATCTAATTTATGTTGTTATACCAACAATACCCCTGCTCTTACTGATAC 
TGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCTGCATAAAAGTAAAGGAAGAACAAAAACTA 
GTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTACCAGAAAAGAAAGTGGCATGGAAGTAT 
AATAACTCATTGACTTGGTTCCAGAATTTTGTAATTCTGGATCTGTATAAGGAATGGCATCAG 
AACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGCAAGATGAACTGTAAGCTCCCCCT 
TGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCATTTAAAGAATATGCTGTGCTA 
AT/^TGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAAACTTCAAACTTCAAGCAAA 
TGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTATGTGTGTTAGAAGCAAT 
TCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAATGTATATTGTATTGA 
AATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAAATGAACTGTTCTA 
ATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAGAAACTTATTAC 
TGTTGTCAACTGAATTCACACACACACAAATATAGTACCATAGAAAAAGTTTGTTTTCTCGAA 
ATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGAAATAAGA 
AGCTATTTCATTAAGTGTGATATAAACCTCCTCAAACATTTTACTTAGAGGCAAGGATTGTCT 
AATTTCAATTGTGCAAGACATGTGCCTTATAATTATTTTTAGCTTAAAATTAAACAGATTTTG 
TAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATTTGAACAAAAGAAGT 
GACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTG 
AGGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTT 
GGGATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGC 
CTCTGACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAG 
TGTGGAGACAAGCACAGCACACAGACATTTTAGGAAGGAAAGGAACTACGAAATCGTGTGAAA 
ATGGGTTGGAACCCATCAGTGATCGCATATTCATTGATGAC^GGTTTGCTTGAGATAGAAAATG 
GTGGGTCCTTTCTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGA 
AAGTTGTAACTCTCTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTG 
TTTCTGGGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 540 

MSRVVSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACES 
EGGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSN 
SQYRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAP 
VEKPYLTNQPGDTHQNVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKT 
SPNQSTLWISKSTRKESGMEV 

Important features: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-89 and 255-258 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 266-269 



N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114 , 140-145 
and 212-217 
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FIGURE 541 

GGAGAATGGAGAGAGCAGTGAGAGTGGAGTCCGGGGTCCTGGTCGGGGTGGTCTGTCTGCTCCTGGCATGCCCTG 
CCACAGCCACTGGGCCCGAAGTTGCTCAGCCTGAAGTAGACACCACCCTGGGTCGTGTGCGAGGCCGGCAGGTGG 
GCGTGAAGGGCACAGACCGCCTTGTGAATGTCTTTCTGGGCATTCCATTTGCCCAGCCGCCACTGGGCCCTGACC 
GGTTCTCAGCCCCACACCCAGCACAGCCCTGGGAGGGTGTGCGGGATGCCAGCACTGCGCCCCCAATGTGCCTAC 
AAGACGTGGAGAGCATGAACAGCAGCAGATTTGTCCTCAACGGAAAACAGCAGATCTTCTCCGTTTCAGAGGACT 
GCCTGGTCCTCAACGTCTATAGCCCAGCTGAGGTCCCCGCAGGGTCCGGTAGGCCGGTCATGGTATGGGTCCATG 
GAGGCGCTCTGATAACTGGCGCTGCCACCTCCTACGATGGATCAGCTCTGGCTGCCTATGGGGATGTGGTCGTGG 
TTACAGTCCAGTACCGCCTTGGGGTCCTTGGCTTCTTCAGCACTGGAGATGAGCATGCACCTGGCAACCAGGGCT 
TCCTAGATGTGGTAGCTGCTTTGCGCTGGGTGCAAGAAAACATCGCCCCCTTCGGGGGTGACCTCAACTGTGTCA 
CTGTCTTTGGTGGATCTGCCGGTGGGAGCATCATCTCTGGCCTGGTCCTGTCCCCAGTGGCTGCAGGGCTGTTCC 
ACAGAGCCATCACACAGAGTGGGGTCATCACCACCCCAGGGATCATCGACTCTCACCCTTGGCCCCTAGCTCAGA 
AAATCGCAAACACCTTGGCCTGCAGCTCCAGCTCCCCGGCTGAGATGGTGCAGTGCCTTCAGCAGAAAGAAGGAG 
AAGAGCTGGTCCTTAGCAAGAAGCTGAAAAATACTATCTATCCTCTCACCGTTGATGGCACTGTCTTCCCCAAAA 
GCCCCAAGGAACTCCTGAAGGAGAAGCCCTTCCACTCTGTGCCCTTCCTCATGGGTGTCAACAACCATGAGTTCA 
GCTGGCTCATCCCCAGGGGCTGGGGTCTCCTGGATACAATGGAGCAGATGAGCCGGGAGGACATGCTGGCCATCT 
CAACACCCGTCTTGACCAGTCTGGATGTGCCCCCTGAGATGATGCCCACCGTCATAGATGAATACCTAGGAAGCA 
ACTCGGACGCACAAGCCAAATGCCAGGCGTTCCAGGAATTCATGGGTGACGTATTCATCAATGTTCCCACCGTCA 
GTTTTTCAAGATACCTTCGAGATTCTGGAAGCCCTGTCTTTTTCTATGAGTTCCAGCATCGACCCAGTTCTTTTG 
CGAAGATCAAACCTGCCTGGGTGAAGGCTGATCATGGGGCCGAGGGTGCTTTTGTGTTCGGAGGTCCCTTCCTCA 
TGGACGAGAGCTCCCGCCTGGCCTTTCCAGAGGCCACAGAGGAGGAGAAGCAGCTAAGCCTCACCATGATGGCCC 
AGTGGACCCACTTTGCCCGGACAGGGGACCCCAATAGCAAGGCTCTGCCTCCTTGGCCCCAATTCAACCAGGCGG 
AACAATATCTGGAGATCAACGCAGTGCCACGGGCCGGACAGAAGTTCAGGGAGGCCTGGATGCAGTTCTGGTCAG 
AGACGCTCCCCAGCAAGATACAACAGTGGCACCAGAAGCAGAAGAACAGGAAGGCCCAGGAGGACCTCTGAGGCC 
AGGCCTGAACCTTCTTGGCTGGGGCAAACCACTCTTCAAGTGGTGGCAGAGTCCCAGCACGGCAGCCCGCCTCTC 
CCCCTGCTGAGACTTTAATCTCCACCAGCCCTTAAAGTGTCGGCCGCTCTGTGACTGGAGTTATGCTCTTTTGAA 
ATGTCACAAGGCCGCCTCCCACCTCTGGGGCATTGTACAAGTTCTTCCCTCTCCCTGAAGTGCCTTTCCTGCTTT 
CTTCGTGGTAGGTTCTAGCACATTCCTCTAGCTTCCTGGAGGACTCACTCCCCAGGAAGCCTTCCCTGCCTTCTC 
TGGGCTGTGCGGCCCCGAGTCTGCGTCCATTAGAGCACAGTCCACCCGAGGCTAGCACCGTGTGTGTGTCTGTCT 
CCCCCTCAGAGGAGCTCTCTCAAAATGGGGATTAGCCTAACCCCACTCTGTCACCCACACCAGGATCGGGTGGGA 
CCTGGAGCTAGGGGGTGTTTGCTGAGTGAGTGAGTGAAACACAGAATATGGGAATGGCAGCTGCTGAACTTGAAC 
CCAGAGCCTTCAGGTGCCAAAGCCATACTCAGGCCCCCACCGACATTGTCCACCCTGGCCAGAAGGGTGCATGCC 
AATGGCAGAGACCTGGGATGGGAGAAGTCCTGGGGCGCCAGGGGATCCAGCCTAGAGCAGACCTTAGCCCCTGAC 
TAAGGCCTCAGACTAGGGCGGGAGGGGTCTCCTCCTCTCTGCTGCCCAGTCCTGGCCCCTGCACAAGACAACAGA 
ATCCATCAGGGCCATGAGTGTCACCCAGACCTGACCCTCACCAATTCCAGCCCCTGACCCTCAGGACGCTGGATG 
CCAGCTCCCAGCCCCAGTGCCGGGTCCTCCCTCCCTTCCTGGCTTGGGGAGACCAGTTTCTGGGGAGCTTCCAAG 
AGCACCCACCAAGACACAGCAGGACAGGCCAGGGGAGGGCATCTGGACCAGGGCATCCGTCGGGCTATTGTCACA 
GAGAAAAGAAGAGACCCACCCACTCGGGCTGCAAAAGGTGAAAAGCACCAAGAGGTTTTCAGATGGAAGTGAGAG 
GTGACAGTGTGCTGGCAGCCCTCACAGCCCTCGCTTGCTCTCCCTGCCGCCTCTGCCTGGGCTCCCACTTTGGCA 
GCACTTGAGGAGCCCTTCAACCCGCCGCTGCACTGTAGGAGCCCCTTTCTGGGCTGGCCAAGGCCGGAGCCAGCT 
CCCTCAGCTTGCGGGGAGGTGCGGAGGGAGAGGGGCGGGCAGGAACCGGGGCTGCGCGCAGCGCTTGCGGGCCAG 
AGTGAGTTCCGGGTGGGCGTGGGCTCGGCGGGGCCCCACTCAGAGCAGCTGGCCGGCCCCAGGCAGTGAGGGCCT 
TAGCACCTGGGCCAGCAGCTGCTGTGCTCGATTTCTCGCTGGGCCTTAGCTGGCTCCCCGCGGGGCAGGGCTCGG 
GACCTGCAGCCCTCCATGCCTGACCCTCCCCCCACCCCCCGTGGGCTCCTGTGCGGCCGGAGCCTCCCCAAGGAG 
CGCCGCCCCCTGCTCCACAGCGCCCAGTCCCATCGACCACCCAAGGGCTGAGGAGTGCGGGTGCACAGCGCGGGA 
CTGGCAGGCAGCTCCACCTGCTGCCCCAGTGCTGGATCCACTGGGTGAAGCCAGCTGGGCTCCTGAGTCTGGTGG 
GGACTTGGAGAACCTTTATGTCTAGCTAAGGGATTGTAAATACACCGATGGGCACTCTGTATCTAGCTCAAGGTT 
TGTAAACACACCAATCAGCACCCTGTGTCTAGCTCAGTGTTTGTGAATGCACCAATCCACACTCTGTATCTGGCT 
ACTCTGGTGGGGACTTGGAGAACCTTTGTGTCCACACTCTGTATCTAGCTAATCTAGTGGGGATGTGGAGAACCT 
TTGTGTCTAGCTCAGGGATCGTAAACGCACCAATCAGCACCCTGTCAAAACAGACCACTTGACTCTCTGTAAAAT 
GGACCAATCAGCAGGATGTGGGTGGGGCGAGACAAGAGAATAAAAGCAGGCTGCCTGAGCCAGCAGTGACAACCC 
CCCTCGGGTCCCCTCCCACGCCGTGGAAGCTTTGTTCTTTCGCTCTTTGCAATAAATCTTGCTACTGCCCAAAA 
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FIGURE 542 

MERAVRVE S G VL VG VVC LLLAC PATATG PE VAQ PE VDT TLGRVRGRQ VG VKGT DRL VN V FLG I 
PFAQPPLGPDRFSAPHPAQPWEGVRDASTAPPMCLQDVESMNSSRFVLNGKQQIFSVSEDCLV 
LNVYSPAEVPAGSGRPVMVWVHGGALITGAATSYDGSALAAYGDVVVVTVQYRLGVLGFFSTG 
DEHAPGNQGFLDWAALRWVQENIAPFGGDLNCVTVFGGSAGGSIISGLVLSPVAAGLFHRAI 
TQSGVITTPGIIDSHPWPLAQKIANTLACSSSSPAEMVQCLQQKEGEELVLSKKLKNTIYPLT 
VDGTVFPKSPKELLKEKPFHSVPFLMGVNNHEFSWLIPRGWGLLDTMEQMSREDMLAISTPVL 
TSLDVPPEMMPTVIDEYLGSNSDAQAKCQAFQEFMGDVFINVPTVSFSRYLRDSGSPVFFYEF 
QHRPSSFAKIKPAWVKADHGAEGAFVFGGPFLMDESSRLAFPEATEEEKQLSLTMMAQWTHFA 
RTGDPNSFCALPPWPQFNQAEQYLEINPVPRAGQKFREAWMQFWSETLPSKIQQWHQKQKNRKA 
QEDL 

Important features: 
Signal peptide: 

amino acids 1-27 

Transmembrane domain: 

amino acids 226-245 

N-glycosylation site. 

amino acids 105-109 

N-myristoylation sites . 

amino acids 10-16, 49-55, 62-68, 86-92, 150-156, 155-161, 
162-168, 217-223, 227-233, 228-234, 232-238, 262-268, 357-363, 
' 461-467 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 12-23 

Carboxylesterases type-B serine active site. 

amino acids 216-232 
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FIGURE 543 



TGTCGCCTGGCCCTCGCCATGCAGACCCCGCGAGCGTCCCCTCCCCGCCCGGCCCTCCTGCTTCTGCTGCTGCTA 
CTGGGGGGCGCCCACGGCCTCTTTCCTGAGGAGCCGCCGCCGCTTAGCGTGGCCCCCAGGGACTACCTGAACCAC 
TATCCCGTGTTTGTGGGCAGCGGGCCCGGACGCCTGACCCCCGCAGAAGGTGCTGACGACCTCAACATCCAGCGA 
GTCCTGCGGGTCAACAGGACGCTGTTCATTGGGGACAGGGACAACCTCTACCGCGTAGAGCTGGAGCCCCCCACG 
TCCACGGAGCTGCGGTACCAGAGGAAGCTGACCTGGAGATCTAACCCCAGCGACATAAACGTGTGTCGGATGAAG 
GGCAAACAGGAGGGCGAGTGTCGAAACTTCGTAAAGGTGCTGCTCCTTCGGGACGAGTCCACGCTCTTTGTGTGC 
GGTTCCAACGCCTTCAACCCGGTGTGCGCCAACTACAGCATAGACACCCTGCAGCCCGTCGGAGACAACATCAGC 
GGTATGGCCCGCTGCCCGTACGACCCCAAGCACGCCAATGTTGCCCTCTTCTCTGACGGGATGCTCTTCACAGCT 
ACTGTTACCGACTTCCTAGCCATTGATGCTGTCATCTACCGCAGCCTCGGGGACAGGCCCACCCTGCGCACCGTG 
AAACATGACTCCAAGTGGTTCAAAGAGCCTTACTTTGTCCATGCGGTGGAGTGGGGCAGCCATGTCTACTTCTTC 
TTCCGGGAGATTGCGATGGAGTTTAACTACCTGGAGAAGGTGGTGGTGTCCCGCGTGGCCCGAGTGTGCAAGAAC 
GACGTGGGAGGCTCCCCCCGCGTGCTGGAGAAGCAGTGGACGTCCTTCCTGAAGGCGCGGCTCAACTGCTCTGTA 
CCCGGAGACTCCCATTTCTACTTCT^ACGTGCTGCAGGCTGTCACGGGCGTGGTCAGCCTCGGGGGCCGGCCCGTG 
GTCCTGGCCGTTTTTTCCACGCCCAGCAACAGCATCCCTGGCTCGGCTGTCTGCGCCTTTGACCTGACACAGGTG 
GCAGCTGTGTTTGAAGGCCGCTTCCGAGAGCAGAAGTCCCCCGAGTCCATCTGGACGCCGGTGCCGGAGGATCAG 
GTGCCTCGACCCCGGCCCGGGTGCTGCGCAGCCCCCGGGATGCAGTACAATGCCTCCAGCGCCTTGCCGGATGAC 
ATCCTCAACTTTGTCAAGACCCACCCTCTGATGGACGAGGCGGTGCCCTCGCTGGGCCATGCGCCCTGGATCCTG 
CGGACCCTGATGAGGCACCAGCTGACTCGAGTGGCTGTGGACGTGGGAGCCGGCCCCTGGGGCAACCAGACCGTT 
GTCTTCCTGGGTTCTGAGGCGGGGACGGTCCTCAAGTTCCTCGTCCGGCCCAATGCCAGCACCTCAGGGACGTCT 
GGGCTCAGTGTCTTCCTGGAGGAGTTTGAGACCTACCGGCCGGACAGGTGTGGACGGCCCGGCGGTGGCGAGACA 
GGGCAGCGGCTGCTGAGCTTGGAGCTGGACGCAGCTTCGGGGGGCCTGCTGGCTGCCTTCCCCCGCTGCGTGGTC 
CGAGTGCCTGTGGCTCGCTGCCAGCAGTACTCGGGGTGTATGAAGAACTGTATCGGCAGTCAGGACCCCTACTGC 
GGGTGGGCCCCCGACGGCTCCTGCATCTTCCTCAGCCCGGGCACCAGAGCCGCCTTTGAGCAGGACGTGTCCGGG 
GCCAGCACCTCAGGCTTAGGGGACTGCACAGGACTCCTGCGGGCCAGCCTCTCCGAGGACCGCGCGGGGCTGGTG 
TCGGTGAACCTGCTGGTAACGTCGTCGGTGGCGGCCTTCGTGGTGGGAGCCGTGGTGTCCGGCTTCAGCGTGGGC 
TGGTTCGTGGGCCTCCGTGAGCGGCGGGAGCTGGCCCGGCGCAAGGACAAGGAGGCCATCCTGGCGCACGGGGCG 
GGCGAGGCGGTGCTGAGCGTCAGCCGCCTGGGCGAGCGCAGGGCGCAGGGTCCCGGGGGCCGGGGCGGAGGCGGT 
GGCGGTGGCGCCGGGGTTCCCCCGGAGGCCCTGCTGGCGCCCCTGATGCAGAACGGCTGGGCCAAGGCCACGCTG 
CTGCAGGGCGGGCCCCACGACCTGQACTCGGGGCTGCTGCCCACGCCCGAGCAGACGCCGCTGCCGCAGAAGCGC 
CTGCCCACTCCGCACCCGCACCCCCACGCCCTGGGCCCCCGCGCCTGGGACCACGGCCACCCCCTGCTCCCGGCC 
TCCGCTTCATCCTCCCTCCTGCTGCTGGCGCCCGCCCGGGCCCCCGAGCAGCCCCCCGCGCCTGGGGAGCCGACC 
CCCGACGGCCGCCTCTATGCTGCCCGGCCCGGCCGCGCCTCCGACGGCGACTTCCCGCTCACCCCCCACGCCAGC 
CCGGACCGCCGGCGGGTGGTGTCCGCGCCCACGGGCCCCTTGGACCCAGCCTCAGCCGCCGATGGCCTCCCGCGG 
CCCTGGAGCCCGCCCCCGACGGGCAGCCTGAGGAGGCCACTGGGCCCCCACGCCCCTCCGGCCGCCACCCTGCGC 
CGCACCCACACGTTCAACAGCGGCGAGGCCCGGCCTGGGGACCGCCACCGCGGCTGCCACGCCCGGCCGGGCACA 
GACTTGGCCCACCTCCTCCCCTATGGGGGGGCGGACAGGACTGCGCCCCCCGTGCCCTAGGCCGGGGGCCCCCCG 
ATGCCTTGGCAGTGCCAGCCACGGGAACCAGGAGCGAGAGACGGTGCCAGAACGCCGGGGCCCGGGGCAACTCCG 
AGTGGGTGCTCAAGTCCCCCCCGCGACCCACCCGCGGAGTGGGGGGCCCCCTCCGCCACAAGGAAGCACAACCAG 
CTCGCCCTCCCCCTACCCGGGGCCGCAGGACGCTGAGACGGTTTGGGGGTGGGTGGGCGGGAGGACTTTGCTATG 
GATTTGAGGTTGACCTTATGCGCGTAGGTTTTGGTTTTTTTTTGCAGTTTTGGTTTCTTTTGCGGTTTTCTAACC 
AATTGCACAACTCCGTTCTCGGGGTGGCGGCAGGCAGGGGAGGCTTGGACGCCGGTGGGGAATGGGGGGCCACAG 
CTGCAGACCTAAGCCCTCCCCCACCCCTGGAAAGGTCCCTCCCCAACCCAGGCCCCTGGCGTGTGTGGGTGTGCG 
TGCGTGTGCGTGCCGTGTTCGTGTGCAAGGGGCCGGGGAGGTGGGCGTGTGTGTGCGTGCCAGCGAAGGCTGCTG 
TGGGCGTGTGTGTCAAGTGGGCCACGCGTGCAGGGTGTGTGTCCACGAGCGACGATCGTGGTGGCCCCAGCGGCC 
TGGGCGTTGGCTGAGCCGACGCTGGGGCTTCCAGAAGGCCCGGGGGTCTCCGAGGTGCCGGTTAGGAGTTTGAAC 
CCCCCCCACTCTGCAGAGGGAAGCGGGGACAATGCCGGGGTTTCAGGCAGGAGACACGAGGAGGGCCTGCCCGGA 
AGTCACATCGGCAGCAGCTGTCTAAAGGGCTTGGGGGCCTGGGGGGCGGCGAAGGTGGGTGGGGCCCCTCTGTAA 
ATACGGCCCCAGGGTGGTGAGAGAGTCCCATGCCACCCGTCCCCTTGTGACCTCCCCCCTATGACCTCCAGCTGA 
CCATGCATGCCACGTGGCTGGCTGGGTCCTCTGCCCTCTTTGGAGTTTGCCTCCCCCAGCCCCCTCCCCATCAAT 
AAAACTCTGTTTACAACCAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 544 

MQTPRASPPRPALLLLLLLLGGAHGLFPEEPPPLSVAPRDYLNHYPVFVGSGPGRLTPAEGAD 
DLNIQRVLRVNRTLFIGDRDNLYRVELEPPTSTELRYQRKLTWRSNPSDINVCRMKGKQEGEC 
RNFVKVLLLRDESTLFVCGSNAFNPVCANYSIDTLQPVGDNISGMARCPYDPKHANVALFSDG 
MLFTATVTDFLAIDAVIYRSLGDRPTLRTVKHDSKWFKEPYFVHAVEWGSHVYFFFREIAMEF 
NYLEKVVVSRVARVCKNDVGGSPRVLEKQWTSFLKARLNCSVPGDSHFYFNVLQAVTGWSLG 
GRPWLAVFSTPSNSIPGSAVCAFDLTQVAAVFEGRFREQKSPESIWTPVPEDQVPRPRPGCC 
AAPGMQYNASSALPDDILNFVKTHPLMDEAVPSLGHAPWILRTLMRHQLTRVAVDVGAGPWGN 
QTVVFLGSEAGTVLKFLVRPNASTSGTSGLSVFLEEFETYRPDRCGRPGGGETGQRLLSLELD 
AASGGLLAAFPRCWRVPVARCQQYSGCMKNCIGSQDPYCGWAPDGSCIFLSPGTRAAFEQDV 
SGASTSGLGDCTGLLRASLSEDRAGLVSVNLLVTSSVAAFWGAWSGFSVGWFVGLRERREL 
ARRKDKEAILAHGAGEAVLSVSRLGERRAQGPGGRGGGGGGGAGVPPEALLAPLMQNGWAKAT 
LLQGG PH DL DSGLL PT PEQT PLPQKRL PT PH PH PHALG PRAWDHGHPLL PAS AS S SLLLLAPA 
RAPEQPPAPGEPTPDGRLYAARPGRASHGDFPLTPHASPDRRRVVSAPTGPLDPASAADGLPR 
PWSPPPTGSLRRPLGPHAPPAATLRRTHTFNSGEARPGDRHRGCHARPGTDLAHLLPYGGADR 
TAPPVP 



Important features : 
Signal peptide: 
amino acids 1-25 
Transmembrane domains: 
amino acids 318-339, 598-617 
N-glycosylation sites. 

amino' acids 74-78, 155-159, 167-171, 291-295, 386-390, 441-445, 
462-466 

Glycosaminoglycan attachment sites. 

amino acids 51-55, 5.73-577 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 102-106 
N-myristoylation sites . 

amino acids 21-27, 50-56, 189-195, 333-339, 382-388, 448-454, 
490-496, 491-497, 508-514, 509-515, 531-537, 558-564, 569-575, 
574-580, 580-586, 610-616, 643-649, 663-669, 666-672, 667-673, 
668-674, 669-675, 670-676, 868-874, 879-885 
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FIGURE 545 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGG 
GACCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAA 
ACAAGTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTT 
CCAGGCCTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACG 
ACCTGTGCCACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAG 
GCTTGGCAGTTTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGT 
CTTGCCTTCAGCCTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAG 
ACACTCAATTTGGGAAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCT 
GAGATACGGGGCAGTGTGCAAGCCAAAGATGGAAACATTGACATCAGAATCTTAAGGAGGACT 
GAGTCTTTGCAAGACACAAAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTC 
TATCTGGACAGGGTATTTAAAAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGC 
AGCCTCGCCAATTCCTTTCTTACCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACA 
TGCCATTGTGGGGAGGAAGCAATGAAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTG 
GAACCTCAGGCAGCAGTTGTGAAGGCTTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAG 
GAGACAGAATAGGAGGAAAGTGATGCTGCTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCT 
TCAATACCTGCAGAGGAGGCATGACCCCAAACCACCATCTCTTTACTGTACTAGTCTTGTGCT 
GGTCACAGTGTATCTTATTTATGCATTACTTGCTTCCTTGCATGATTGTCTTTATGCATCCCC 
AATCTTAATTGAGACCATACTTGTATAAGATTTTTGTAATATCTTTCTGCTATTGGATATATT 
TATTAGTTAATATATTTATTTATTTTTTGCTATTTAATGTATTTATTTTTTTACTTGGACATG 
AAACTTTAAAAAAATTCACAGATTATATTTATAACCTGACTAGAGCAGGTGATGTATTTTTAT 
ACAGTAAAAAAAAAAAACCTTGTAAATTCTAGAAGAGTGGCTAGGGGGGTTATTCATTTGTAT 
TCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGATATTTGAAATTGAACCAATGAC 
TACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCACATCTACCTTACAATTACTG 
ACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAGCCAGGAATCCTACACGG 
CCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAAAAAAATyVAAAA 
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FIGURE 546 

MRQFPKTS FDI S PEMS FS I YSLQVPAVPGLTCWALT AE PGWGQNKGATTCATNSHS DSELRPE 
IFSSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQE 
IRNGFSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHY 
TLRKISSLANSFLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDI 
LLQWMEETE 

Important features: 
Signal peptide: 

amino acids 1-42 

gAMP- and cGMP- dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 547 

AGCAACTCAAGTTCATCATTGTCCTGAGAGAGAGGAGCAGCGCGGTTCTCGGCCGGGACAGCA 
GAACGCCAGGGGACCCTCACCTGGGCGCGCCGGGGCACGGGCTTTGATTGTCCTGGGGTCGCG 
GAGACCCGCGCGCCTGCCCTGCACGCCGGGCGGCAACCTTTGCAGTCGCGTTGGCTGCTGCGA 
TCGGCCGGCGGGTCCCTGCCGAAGGCTCGGCTGCTTCTGTCCACCTCTTACACTTCTTCATTT 
ATCGGTGGATCATTTCGAGAGTCCGTCTTGTAAAT6TTTGGCACTTTGCTACTTTATTGCTTC 
TTTCTGGCGACAGTTCCAGCACTCGCCGAGACCGGCGGAGAAAGGCAGCTGAGCCCGGAGAAG 
AGCGAAATATGGGGACCCGGGCTAAAAGCAGACGTCGTCCTTCCCGCCCGCTATTTCTATATT 
CAGGCAGTGGATACATCAGGGAATAAATTCACATCTTCTCCAGGCGAAAAGGTCTTCCAGGTG 
AAAGTCTCAGCACCAGAGGAGCAATTCACTAGAGTTGGAGTCCAGGTTTTAGACCGAAAAGAT 
GGGTCCTTCATAGTAAGATACAGAATGTATGCAAGCTACAAAAATCTGAAGGTGGAAATTAAA 
TTCCAAGGGCAACATGTGGCCAAATCCCCATATATTTTAAAAGGGCCGGTTTACCATGAGAAC 
TGTGACTGTCCTCTGCAAGATAGTGCAGCCTGGCTACGGGAGATGAACTGCCCTGAAACCATT 
GCTCAGATTCAGAGAGATCTGGCACATTTCCCTGCTGTGGATCCAGAAAAGATTGCAGTAGAA 
ATCCCAAAAAGATTTGGACAGAGGCAGAGCCTATGTCACTACACCTTAAAGGATAACAAGGTT 
TATATCAAGACTCATGGTGAACATGTAGGTTTTAGAATTTTCATGGATGCCATACTACTTTCT 
TTGACTAGAAAGGTGAAGATGCCAGATGTGGAGCTCTTTGTTAATTTGGGAGACTGGCCTTTG 
GAAAAAAAGAAATCCAATTCAAACATCCATCCGATCTTTTCCTGGTGTGGCTCCACAGATTCC 
AAGGATATCGTGATGCCTACGTACGATTTGACTGATTCTGTTCTGGAAACCATGGGCCGGGTA 
AGTCTGGATATGATGTCCGTGCAAGCTAACACGGGTCCTCCCTGGGAAAGCAAAAATTCCACT 
GCCGTCTGGAGAGGGCGAGACAGCCGCAAAGAGAGACTCGAGCTGGTTAAACTCAGTAGAAAA 
CACCCAGAACTCATAGACGCTGCTTTCACCAACTTTTTCTTCTTTAAACACGATGAAAACCTG 
TATGGTCCCATTGTGAAACATATTTCATTTTTTGATTTCTTCAAGCATAAGTATCAAATAAAT 
ATCGATGGCACTGTAGCAGCTTATCGCCTGCCATATTTGCTAGTTGGTGACAGTGTTGTGCTG 
AAGCAGGATTCCATCTACTATGAACATTTTTACAATGAGCTGCAGCCCTGGAAACACTACATT 
CCAGTTAAGAGCAACCTGAGCGAT.CTGCTAGAAAAACTTAAATGGGCGAAAGATCACGATG^ 
GAGGCCAAAAAGATAGCAAAAGCAGGACAAGAATTTGCAAGAAATAATCTCATGGGCGATGAC 
ATATTCTGTTATTATTTCAAACTTTTCCAGGAATATGCCAATTTACAAGTGAGTGAGCCCCAA 
ATCCGAGAGGGCATGAAAAGGGTAGAACCACAGACTGAGGACGACCTCTTCCCTTGTACTTGC 
CATAGGAAAAAGACCAAAGATGAACTCTGATATGCAAAATAACTTCTATTAGAATAATGGTGC 
TCTGAAGACTCTTCTTAACTAAAAAGAAGAATTTTTTTAAGTATTAATTCCATGGACAATATA 
AAATCTGTGTGATTGTTTGCAGTATGAAGACACATTTCTACTTATGCAGTATTCTCATGACTG 
TACTTTAAAGTACATTTTTAGAATTTTATAATAAAACCACCTTTATTTTAAAGGAAAAAAA 
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MFGTLLLYCFFLATVPALAETGGERQLSPEKSEIWGPGLKADWLPARYFYIQAVDTSGNKFT 
SSPGEKVFQVKVSAPEEQFTRVGVQVLDRKDGSFIVRYRMYASYKNLKVEIKFQGQHVAKSPY 
ILKGPVYHENCDCPLQDSAAWLREMNCPETIAQIQRDLAHFPAVDPEKIAVEIPKRFGQRQSL 
CHYTLKDNKVYIKTHGEHVGFRIFMDAILLSLTRKVKMPDVELFVNLGDWPLEKKKSNSNIHP 
IFSWCGSTDSKDIVMPTYDLTDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRKE 
RLELVKLSRKHPELIDAAFTNFFFFKHDENLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLP 
YLLVGDSWLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLKWAKDHDEEAKKIAKAGQE 
FARNNLMGDDIFCYYFKLFQEYANLQVSEPQIREGMKRVEPQTEDDLFPCTCHRKKTKDEL 

Important features: 
Signal peptide: 

amino acids 1-17 

N-glycosylation sites. 

amino acids 302-306, 414-418 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 243-247/ 495-499 

Tyrosine kinase phosphorylation site. 

amino acids 341-348 

N-myristoylation sites. 

amino acids 59-65, 118-124, 184-190, 258-264, 370-376, 439-445 

Endoplasmic reticulum targeting sequence. 

amino acids 499-504 



WO 01/40466 



PCTYUS00/32678 



549/550 
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GGGTGATTGAACTAAACCTTCGCCGCACCGAGTTTGCAGTACGGCCGTCACCCGCACCGCTGC 
CTGCTTGCGGTTGGAGAAATCAAGGCCCTACCGGGCCTCCGTAGTCACCTCTCTATAGTGGGC 
GTGGCCGAGGCCGGGGTGACCCTGCCGGAGCCTCCGCTGCCAGCGACATGTTCAAGGTAATTC 
AGAGGTCCGTGGGGCCAGCCAGCCTGAGCTTGCTCACCTTCAAAGTCTATGCAGCACCAAAAA 
AGGACTCACCTCCCAAAAATTCCGTGAAGGTTGATGAGCTTTCACTCTACTCAGTTCCTGAGG 
GTCAATCGAAGTATGTGGAGGAGGCAAGGAGCCAGCTTGAAGAAAGCATCTCACAGCTCCGAC 
ACTATTGCGAGCCATACACAACCTGGTGTCAGGAAACGTACTCCCAAACTAAGCCCAAGATGC 
AAAGTTTGGTTCAATGGGGGTTAGACAGCTATGACTATCTCCAAAATGCACCTCCTGGATTTT 
TTCCGAGACTTGGTGTTATTGGTTTTGCTGGCCTTATTGGACTCCTTTTGGCTAGAGGTTCAA 
AAATAAAGAAGCTAGTGTATCCGCCTGGTTTCATGGGATTAGCTGCCTCCCTCTATTATCCAC 
AACAAGCCATCGTGTTTGCCCAGGTCAGTGGGGAGAGATTATAT.GACTGGGGTTTACGAGGAT 
ATATAGTCATAGAAGATTTGTGGAAGGAGAACTTTCAAAAGCCAGGAAATGTGAAGAATTCAC 
CTGGAACTAAGTA6AAAACTCCATGCTCTGCCATCTTAATCAGTTATAGGTAAACATTGGAAA 
CTCCATAGAATAAATCAGTATTTCTACAGAAAAATGGCATAGAAGTCAGTATTGAATGTATTA 
AATTGGCTTTCTTCTTCAGGAAAAACTAGACCAGACCTCTGTTATCTTCTGTGAAATCATCCT 
ACAAGCAAACTAACCTGGAATCCCTTCACCTAGAGATAATGTACAAGCCTTAGAACTCCTCAT 
TCTCATGTTGCTATTTATGTACCTAATTAAAACCCAAGTTTAAAAAAAAAAAAAAAAAAAAAA 
' AAAAAAAAAAAAAAAAA 
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FIGURE 550 

MFKVIQRSVGPASLSLLTFKVYAAPKKDSPPKNSVKVDELSLYSVPEGQSKYVEEARSQLEES 
ISQLRHYCEPYTTWCQETYSQTKPKMQSLVQWGLDSYDYLQNAPPGFFPRLGVIGFAGLIGLL 
LARGSKIKKLVYPPGFMGLAASLYYPQQAIVFAQVSGERLYDWGLRGYIVIEDLWKENFQKPG 
NVKNSPGTK 

Important features: 
Signal peptide: 

Amino acids 1-23 

Transmembrane domain : 

Amino acids 111-130 

cAMP- and cGMP-dependent protein kinase phosphorylation site: 

Amino acids 26-30 

Tyrosine kinase phosphorylation site: 

Amino acids 36-44 

N-myristoylation sites: 

Amino acids 124-130; 144-150; 189-195 



